TY - JOUR
T1 - Optimal analysis of subset-selection based lp low-rank approximation
AU - Dan, Chen
AU - Zhang, Hongyang
AU - Wang, Hong
AU - Zhou, Yuchen
AU - Ravikumar, Pradeep
N1 - Funding Information:
C.D. and P.R. acknowledge the support of Rakuten Inc., and NSF via IIS1909816. The authors would also like to acknowledge two MathOverflow users, known to us only by their usernames, ’fedja’ and ’Mahdi’, for informing us the Riesz-Thorin interpolation theorem.
Publisher Copyright:
© 2019 Neural information processing systems foundation. All rights reserved.
PY - 2019
Y1 - 2019
N2 - We study the low rank approximation problem of any given matrix A over Rn×m and Cn×m in entry-wise lp loss, that is, finding a rank-k matrix X such that kA - Xkp is minimized. Unlike the traditional l2 setting, this particular variant is NP-Hard. We show that the algorithm of column subset selection, which was an algorithmic foundation of many existing algorithms, enjoys approximation ratio (k + 1)1/p for 1 = p = 2 and (k + 1)1-1/p for p = 2. This improves upon the previous O(k + 1) bound for p = 1 [1]. We complement our analysis with lower bounds; these bounds match our upper bounds up to constant 1 when p = 2. At the core of our techniques is an application of Riesz-Thorin interpolation theorem from harmonic analysis, which might be of independent interest to other algorithmic designs and analysis more broadly. As a consequence of our analysis, we provide better approximation guarantees for several other algorithms with various time complexity. For example, to make the algorithm of column subset selection computationally efficient, we analyze a polynomial time bi-criteria algorithm which selects O(k log m) columns. We show that this algorithm has an approximation ratio of O((k + 1)1/p) for 1 = p = 2 and O((k + 1)1-1/p) for p = 2. This improves over the best-known bound with an O(k + 1) approximation ratio. Our bi-criteria algorithm also implies an exact-rank method in polynomial time with a slightly larger approximation ratio.
AB - We study the low rank approximation problem of any given matrix A over Rn×m and Cn×m in entry-wise lp loss, that is, finding a rank-k matrix X such that kA - Xkp is minimized. Unlike the traditional l2 setting, this particular variant is NP-Hard. We show that the algorithm of column subset selection, which was an algorithmic foundation of many existing algorithms, enjoys approximation ratio (k + 1)1/p for 1 = p = 2 and (k + 1)1-1/p for p = 2. This improves upon the previous O(k + 1) bound for p = 1 [1]. We complement our analysis with lower bounds; these bounds match our upper bounds up to constant 1 when p = 2. At the core of our techniques is an application of Riesz-Thorin interpolation theorem from harmonic analysis, which might be of independent interest to other algorithmic designs and analysis more broadly. As a consequence of our analysis, we provide better approximation guarantees for several other algorithms with various time complexity. For example, to make the algorithm of column subset selection computationally efficient, we analyze a polynomial time bi-criteria algorithm which selects O(k log m) columns. We show that this algorithm has an approximation ratio of O((k + 1)1/p) for 1 = p = 2 and O((k + 1)1-1/p) for p = 2. This improves over the best-known bound with an O(k + 1) approximation ratio. Our bi-criteria algorithm also implies an exact-rank method in polynomial time with a slightly larger approximation ratio.
UR - http://www.scopus.com/inward/record.url?scp=85090174569&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85090174569&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85090174569
SN - 1049-5258
VL - 32
JO - Advances in Neural Information Processing Systems
JF - Advances in Neural Information Processing Systems
T2 - 33rd Annual Conference on Neural Information Processing Systems, NeurIPS 2019
Y2 - 8 December 2019 through 14 December 2019
ER -