TY - GEN
T1 - Learning fast approximations of sparse coding
AU - Gregor, Karol
AU - LeCun, Yann
PY - 2010
Y1 - 2010
N2 - In Sparse Coding (SC), input vectors are reconstructed using a sparse linear combination of basis vectors. SC has become a popular method for extracting features from data. For a given input, SC minimizes a quadratic reconstruction error with an L1 penalty term on the code. The process is often too slow for applications such as real-time pattern recognition. We proposed two versions of a very fast algorithm that produces approximate estimates of the sparse code that can be used to compute good visual features, or to initialize exact iterative algorithms. The main idea is to train a non-linear, feed-forward predictor with a specific architecture and a fixed depth to produce the best possible approximation of the sparse code. A version of the method, which can be seen as a trainable version of Li and Osher's coordinate descent method, is shown to produce approximate solutions with 10 times less computation than Li and Osher's for the same approximation error. Unlike previous proposals for sparse code predictors, the system allows a kind of approximate "explaining away" to take place during inference. The resulting predictor is differ- entiable and can be included into globally-trained recognition systems.
AB - In Sparse Coding (SC), input vectors are reconstructed using a sparse linear combination of basis vectors. SC has become a popular method for extracting features from data. For a given input, SC minimizes a quadratic reconstruction error with an L1 penalty term on the code. The process is often too slow for applications such as real-time pattern recognition. We proposed two versions of a very fast algorithm that produces approximate estimates of the sparse code that can be used to compute good visual features, or to initialize exact iterative algorithms. The main idea is to train a non-linear, feed-forward predictor with a specific architecture and a fixed depth to produce the best possible approximation of the sparse code. A version of the method, which can be seen as a trainable version of Li and Osher's coordinate descent method, is shown to produce approximate solutions with 10 times less computation than Li and Osher's for the same approximation error. Unlike previous proposals for sparse code predictors, the system allows a kind of approximate "explaining away" to take place during inference. The resulting predictor is differ- entiable and can be included into globally-trained recognition systems.
UR - http://www.scopus.com/inward/record.url?scp=77956515664&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77956515664&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:77956515664
SN - 9781605589077
T3 - ICML 2010 - Proceedings, 27th International Conference on Machine Learning
SP - 399
EP - 406
BT - ICML 2010 - Proceedings, 27th International Conference on Machine Learning
T2 - 27th International Conference on Machine Learning, ICML 2010
Y2 - 21 June 2010 through 25 June 2010
ER -