TY - JOUR
T1 - Unsupervised learning of hierarchical representations with convolutional deep belief networks
AU - Lee, Honglak
AU - Grosse, Roger
AU - Ranganath, Rajesh
AU - Ng, Andrew Y.
PY - 2011/10
Y1 - 2011/10
N2 - There has been much interest in unsupervised learning of hierarchical generative models such as deep belief networks (DBNs); however, scaling such models to full-sized, highdimensional images remains a difficult problem. To address this problem, we present the convolutional deep belief network, a hierarchical generative model that scales to realistic image sizes. This model is translation-invariant and supports efficient bottom-up and top-down probabilistic inference. Key to our approach is probabilistic max-pooling, a novel technique that shrinks the representations of higher layers in a probabilistically sound way. Our experiments show that the algorithm learns useful high-level visual features, such as object parts, from unlabeled images of objects and natural scenes. We demonstrate excellent performance on several visual recognition tasks and show that our model can perform hierarchical (bottom-up and top-down) inference over full-sized images.
AB - There has been much interest in unsupervised learning of hierarchical generative models such as deep belief networks (DBNs); however, scaling such models to full-sized, highdimensional images remains a difficult problem. To address this problem, we present the convolutional deep belief network, a hierarchical generative model that scales to realistic image sizes. This model is translation-invariant and supports efficient bottom-up and top-down probabilistic inference. Key to our approach is probabilistic max-pooling, a novel technique that shrinks the representations of higher layers in a probabilistically sound way. Our experiments show that the algorithm learns useful high-level visual features, such as object parts, from unlabeled images of objects and natural scenes. We demonstrate excellent performance on several visual recognition tasks and show that our model can perform hierarchical (bottom-up and top-down) inference over full-sized images.
UR - http://www.scopus.com/inward/record.url?scp=80053540444&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80053540444&partnerID=8YFLogxK
U2 - 10.1145/2001269.2001295
DO - 10.1145/2001269.2001295
M3 - Article
AN - SCOPUS:80053540444
SN - 0001-0782
VL - 54
SP - 95
EP - 103
JO - Communications of the ACM
JF - Communications of the ACM
IS - 10
ER -