TY - JOUR
T1 - Learning object categories from internet image searches
AU - Fergus, Rob
AU - Fei-Fei, Li
AU - Perona, Pietro
AU - Zisserman, Andrew
N1 - Funding Information:
Dr. Fei-Fei is a recipient of a Microsoft Research New Faculty award, a Google research award and a National Science Foundation (NSF) CAREER award.
Funding Information:
Manuscript received April 7, 2009; revised September 23, 2009; accepted March 22, 2010. Date of publication June 10, 2010; date of current version July 21, 2010. This work was supported by the Caltech Center for Neuromorphic Systems Engineering (CNSE), the U.K. Engineering and Physical Sciences Research Council (EPSRC), European Union NOE PASCAL, the European Research Council (ERC) under Grant VisRec, and the U.S. Office of Naval Research (ONR) Multidisciplinary University Research Initiative (MURI) under Grants N00014-06-1-0734 and N00014-07-1-0182. R. Fergus is with the Department of Computer Science, Courant Institute, New York University, New York, NY 10003 USA (e-mail: [email protected]). L. Fei-Fei is with the Department of Computer Science, Stanford University, Stanford, CA 94305 USA. P. Perona is with the Department of Electrical Engineering, California Institute of Technology (Caltech), Pasadena, CA 91125 USA. A. Zisserman is with the Department of Engineering Science, University of Oxford, Oxford OX1 3PJ, U.K.
PY - 2010/8
Y1 - 2010/8
N2 - In this paper, we describe a simple approach to learning models of visual object categories from images gathered from Internet image search engines. The images for a given keyword are typically highly variable, with a large fraction being unrelated to the query term, and thus pose a challenging environment from which to learn. By training our models directly from Internet images, we remove the need to laboriously compile training data sets, required by most other recognition approachesthis opens up the possibility of learning object category models on-the-fly. We describe two simple approaches, derived from the probabilistic latent semantic analysis (pLSA) technique for text document analysis, that can be used to automatically learn object models from these data. We show two applications of the learned model: first, to rerank the images returned by the search engine, thus improving the quality of the search engine; and second, to recognize objects in other image data sets.
AB - In this paper, we describe a simple approach to learning models of visual object categories from images gathered from Internet image search engines. The images for a given keyword are typically highly variable, with a large fraction being unrelated to the query term, and thus pose a challenging environment from which to learn. By training our models directly from Internet images, we remove the need to laboriously compile training data sets, required by most other recognition approachesthis opens up the possibility of learning object category models on-the-fly. We describe two simple approaches, derived from the probabilistic latent semantic analysis (pLSA) technique for text document analysis, that can be used to automatically learn object models from these data. We show two applications of the learned model: first, to rerank the images returned by the search engine, thus improving the quality of the search engine; and second, to recognize objects in other image data sets.
KW - Internet image search engines
KW - Learning
KW - Object categories
KW - Recognition
KW - Unsupervised
UR - http://www.scopus.com/inward/record.url?scp=77954860650&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77954860650&partnerID=8YFLogxK
U2 - 10.1109/JPROC.2010.2048990
DO - 10.1109/JPROC.2010.2048990
M3 - Article
AN - SCOPUS:77954860650
SN - 0018-9219
VL - 98
SP - 1453
EP - 1466
JO - Proceedings of the IEEE
JF - Proceedings of the IEEE
IS - 8
M1 - 5483225
ER -