A unified scheme for image segmentation and object recognition
Qian, Richard Junqiang
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/19980
Description
Title
A unified scheme for image segmentation and object recognition
Author(s)
Qian, Richard Junqiang
Issue Date
1996
Doctoral Committee Chair(s)
Huang, Thomas S.
Department of Study
Electrical and Computer Engineering
Discipline
Electrical and Computer Engineering
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
Ph.D.
Degree Level
Dissertation
Keyword(s)
Engineering, Electronics and Electrical
Computer Science
Language
eng
Abstract
In this thesis, we present a unified scheme for image segmentation and object recognition. The scheme unifies image segmentation and object recognition via three serial stages: (1) optimal edge detection and region detection; (2) scale, position and orientation invariant object detection; and (3) high-level knowledge-based image segmentation.
In the first stage, edges are detected using a new edge detection algorithm and regions are extracted using a morphological-watershed-like segmentation algorithm. The new edge detection algorithm detects edges in an image with a curve-segment-based edge detection functional, which uses the Laplacian of Gaussian (LOG) zero-crossing contours as initial conditions to approach the true edge locations. The edge detection functional is shown to be optimal in terms of signal-to-noise ratio and edge localization accuracy for detecting general 2D edges. In addition, the resulting edge candidates preserve the nice scaling behavior of the LOG zero-crossing contours in scale space.
Based on the edge and region detection results from the first stage, the second stage detects objects of interest, e.g., human faces, using a new object detection algorithm. The new algorithm combines template matching methods with feature-based methods via hierarchical Markov random fields (MRFs) and maximum a posteriori probability (MAP) estimation. Hierarchical MRFs and MAP estimation provide a flexible framework to incorporate various visual clues. The combination of template matching and feature detection is shown to provide robustness for object detection against pose changes, complex background, and partial occlusions.
Finally based on the object detection results from the second stage, the third stage extracts the features of the detected targets, e.g., the eyes and the mouths of the detected faces, using a new deformable template matching algorithm. To reduce the probability of getting stuck in local minima, the new algorithm employs a coarse-to-fine scale space technique and uses the normalized cross-correlation to provide initial conditions for its deformation process. It also uses the optimal edge detection functional developed in this thesis to achieve the best accuracy for localizing the feature boundaries. Experimental results on real images from all the three stages are given in the thesis.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.