Measuring the usefulness of hidden units in Boltzmann machines with mutual information

Mathias Berglund, Tapani Raiko, Kyunghyun Cho

Research output: Contribution to journalArticlepeer-review

Abstract

Restricted Boltzmann machines (RBMs) and deep Boltzmann machines (DBMs) are important models in deep learning, but it is often difficult to measure their performance in general, or measure the importance of individual hidden units in specific. We propose to use mutual information to measure the usefulness of individual hidden units in Boltzmann machines. The measure is fast to compute, and serves as an upper bound for the information the neuron can pass on, enabling detection of a particular kind of poor training results. We confirm experimentally that the proposed measure indicates how much the performance of the model drops when some of the units of an RBM are pruned away. We demonstrate the usefulness of the measure for early detection of poor training in DBMs.

Original languageEnglish (US)
Pages (from-to)12-18
Number of pages7
JournalNeural Networks
Volume64
DOIs
StatePublished - Apr 1 2015

Keywords

  • Deep Boltzmann machine
  • Deep learning
  • Mutual information
  • Pruning
  • Restricted Boltzmann machine
  • Structural learning

ASJC Scopus subject areas

  • Cognitive Neuroscience
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Measuring the usefulness of hidden units in Boltzmann machines with mutual information'. Together they form a unique fingerprint.

Cite this