* Faculty       * Staff       * Students & Alumni       * Committees       * Contact       * Institute Directory
* Undergraduate Program       * Graduate Program       * Courses       * Institute Catalog      
* Undergraduate       * Graduate       * Institute Admissions: Undergraduate | Graduate      
* Colloquia       * Seminars       * News       * Events       * Institute Events      
* Overview       * Lab Manual       * Institute Computing      
No Menu Selected

* Research

Ph.D. Theses

Towards Fully-automatic and General-purpose Image Registration

By Gehua Yang
Advisor: Charles V. Stewart
June 12, 2007

Focusing on two image registration problems: 1) 2d-to-2d image alignment and 2) camera location estimation, we present two fully-automatic and general-purpose methods that share the same approach. The first method, named GDB-ICP, is an automated 2d-image-pair registration algorithm using a hypothesis-and-test strategy and an extension of the Dual-Bootstrap method for refinement. This algorithm is capable of aligning images taken of a wide variety of natural and man-made scenes as well as many medical images, tolerating low overlap, substantial orientation and scale differences, large illumination variations, and physical changes in the scene. An important component of this is the ability to automatically reject pairs that have no overlap or have too many differences to be aligned well. Experimental results on a data set of 22 challenging image pairs show that the algorithm effectively aligns 19 of the 22 pairs and rejects 99.8% of the misalignments that occur when all possible pairs are tried. An extension of the algorithmic principle of GDB-ICP, the second method, a 3d-to-2d alignment algorithm, is applied to estimate the location of a hand-held camera with respect to a 3d model augmented with texture information. Little prior knowledge is assumed about the camera location. A key issue is that initially the model-to-image mapping is well-approximated by a simple 2d-to-2d transformation based on a local model surface approximation. However, the algorithm must transition to the 3d-to-2d projection necessary to solve the position estimation problem. The experiments are conducted on a collection of 9 range scans and 60 image covering approximately a 100m x 100m region of RPI campus. The algorithm successfully and correctly determines the camera location of 52 images, while indicates it cannot find an alignment for the remaining 8. We will also briefly discuss the image-registration-related methods developed in the context of retinal image analysis.

* Return to main PhD Theses page