Learning object categories from Google's image search

R. Fergus, L. Fei-Fei, P. Perona, A. Zisserman

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Current approaches to object category recognition require datasets of training images to be manually prepared, with varying degrees of supervision. We present an approach that can learn an object category from just its name, by utilizing the raw output of image search engines available on the Internet. We develop a new model, TSI-pLSA, which extends pLSA (as applied to visual words) to include spatial information in a translation and scale invariant manner. Our approach can handle the high infra-class variability and large proportion of unrelated images returned by search engines. We evaluate the models on standard test sets, showing performance competitive with existing methods trained on hand prepared datasets.

Original languageEnglish (US)
Title of host publicationProceedings - 10th IEEE International Conference on Computer Vision, ICCV 2005
Pages1816-1823
Number of pages8
DOIs
StatePublished - 2005
EventProceedings - 10th IEEE International Conference on Computer Vision, ICCV 2005 - Beijing, China
Duration: Oct 17 2005Oct 20 2005

Publication series

NameProceedings of the IEEE International Conference on Computer Vision
VolumeII

Other

OtherProceedings - 10th IEEE International Conference on Computer Vision, ICCV 2005
Country/TerritoryChina
CityBeijing
Period10/17/0510/20/05

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Learning object categories from Google's image search'. Together they form a unique fingerprint.

Cite this