Large-scale SVD and manifold learning

Ameet Talwalkar, Sanjiv Kumar, Mehryar Mohri, Henry Rowley

Research output: Contribution to journalArticlepeer-review

Abstract

Abstract This paper examines the efficacy of sampling-based low-rank approximation techniques when applied to large dense kernel matrices. We analyze two common approximate singular value decomposition techniques, namely the Nystr om and Column sampling methods. We present a theoretical comparison between these two methods, provide novel insights regarding their suitability for various tasks and present experimental results that support our theory. Our results illustrate the relative strengths of each method. We next examine the performance of these two techniques on the largescale task of extracting low-dimensional manifold structure given millions of high-dimensional face images. We address the computational challenges of non-linear dimensionality reduction via Isomap and Laplacian Eigenmaps, using a graph containing about 18 million nodes and 65 million edges. We present extensive experiments on learning low-dimensional embeddings for two large face data sets: CMU-PIE (35 thousand faces) and a web data set (18 million faces). Our comparisons show that the Nystr om approximation is superior to the Column sampling method for this task. Furthermore, approximate Isomap tends to perform better than Laplacian Eigenmaps on both clustering and classification with the labeled CMU-PIE data set.

Original languageEnglish (US)
Pages (from-to)3129-3152
Number of pages24
JournalJournal of Machine Learning Research
Volume14
StatePublished - Oct 2013

Keywords

  • Large-scale matrix factorization
  • Low-rank approximation
  • Manifold learning

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Large-scale SVD and manifold learning'. Together they form a unique fingerprint.

Cite this