TY - GEN
T1 - A sparse and locally shift invariant feature extractor applied to document images
AU - Ranzato, Marc Aurelio
AU - LeCun, Yann
PY - 2007
Y1 - 2007
N2 - We describe an unsupervised learning algorithm for extracting sparse and locally shift-invariant features. We also devise a principled procedure for learning hierarchies of invariant features. Each feature detector is composed of a set of trainable convolutional filters followed by a max-pooling layer over non-overlapping windows, and a point-wise sigmoid non-linearity. A second stage of more invariant features is fed with patches provided by the first stage feature extractor, and is trained in the same way. The method is used to pre-train the first four layers of a deep convolutional network which achieves state-of-the-art performance on the MNIST dataset of handwritten digits. The final testing error rate is equal to 0.42%. Preliminary experiments on compression of bitonal document images show very promising results in terms of compression ratio and reconstruction error.
AB - We describe an unsupervised learning algorithm for extracting sparse and locally shift-invariant features. We also devise a principled procedure for learning hierarchies of invariant features. Each feature detector is composed of a set of trainable convolutional filters followed by a max-pooling layer over non-overlapping windows, and a point-wise sigmoid non-linearity. A second stage of more invariant features is fed with patches provided by the first stage feature extractor, and is trained in the same way. The method is used to pre-train the first four layers of a deep convolutional network which achieves state-of-the-art performance on the MNIST dataset of handwritten digits. The final testing error rate is equal to 0.42%. Preliminary experiments on compression of bitonal document images show very promising results in terms of compression ratio and reconstruction error.
UR - http://www.scopus.com/inward/record.url?scp=51149113745&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51149113745&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2007.4377108
DO - 10.1109/ICDAR.2007.4377108
M3 - Conference contribution
AN - SCOPUS:51149113745
SN - 0769528228
SN - 9780769528229
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 1213
EP - 1217
BT - Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007
T2 - 9th International Conference on Document Analysis and Recognition, ICDAR 2007
Y2 - 23 September 2007 through 26 September 2007
ER -