TY - GEN
T1 - Gaussian-Bernoulli deep Boltzmann machine
AU - Cho, Kyung Hyun
AU - Raiko, Tapani
AU - Ilin, Alexander
N1 - Copyright:
Copyright 2014 Elsevier B.V., All rights reserved.
PY - 2013
Y1 - 2013
N2 - In this paper, we study a model that we call Gaussian-Bernoulli deep Boltzmann machine (GDBM) and discuss potential improvements in training the model. GDBM is designed to be applicable to continuous data and it is constructed from Gaussian-Bernoulli restricted Boltzmann machine (GRBM) by adding multiple layers of binary hidden neurons. The studied improvements of the learning algorithm for GDBM include parallel tempering, enhanced gradient, adaptive learning rate and layer-wise pretraining. We empirically show that they help avoid some of the common difficulties found in training deep Boltzmann machines such as divergence of learning, the difficulty in choosing right learning rate scheduling, and the existence of meaningless higher layers.
AB - In this paper, we study a model that we call Gaussian-Bernoulli deep Boltzmann machine (GDBM) and discuss potential improvements in training the model. GDBM is designed to be applicable to continuous data and it is constructed from Gaussian-Bernoulli restricted Boltzmann machine (GRBM) by adding multiple layers of binary hidden neurons. The studied improvements of the learning algorithm for GDBM include parallel tempering, enhanced gradient, adaptive learning rate and layer-wise pretraining. We empirically show that they help avoid some of the common difficulties found in training deep Boltzmann machines such as divergence of learning, the difficulty in choosing right learning rate scheduling, and the existence of meaningless higher layers.
UR - http://www.scopus.com/inward/record.url?scp=84893549229&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893549229&partnerID=8YFLogxK
U2 - 10.1109/IJCNN.2013.6706831
DO - 10.1109/IJCNN.2013.6706831
M3 - Conference contribution
AN - SCOPUS:84893549229
SN - 9781467361293
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - 2013 International Joint Conference on Neural Networks, IJCNN 2013
T2 - 2013 International Joint Conference on Neural Networks, IJCNN 2013
Y2 - 4 August 2013 through 9 August 2013
ER -