Abstract
We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexible constellations of parts. A probabilistic representation is used for all aspects of the object: shape, appearance, occlusion and relative scale. An entropy-based feature detector is used to select regions and their scale within the image. In learning the parameters of the scale-invariant object model are estimated. This is done using expectation-maximization in a maximum-likelihood setting. In recognition, this model is used in a Bayesian manner to classify images. The flexible nature of the model is demonstrated by excellent results over a range of datasets including geometrically constrained classes (e.g. faces, cars) and flexible objects (such as animals).
Original language | English (US) |
---|---|
Title of host publication | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
Volume | 2 |
State | Published - 2003 |
Event | 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Madison, WI, United States Duration: Jun 18 2003 → Jun 20 2003 |
Other
Other | 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
---|---|
Country/Territory | United States |
City | Madison, WI |
Period | 6/18/03 → 6/20/03 |
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Software
- Control and Systems Engineering
- Electrical and Electronic Engineering