TY - GEN
T1 - Learning Inductive Biases with Simple Neural Networks
AU - Feinman, Reuben
AU - Lake, Brenden M.
N1 - Publisher Copyright:
© 2018 Proceedings of the 40th Annual Meeting of the Cognitive Science Society, CogSci 2018. All rights reserved.
PY - 2018
Y1 - 2018
N2 - People use rich prior knowledge about the world in order to efficiently learn new concepts. These priors-also known as “inductive biases”-pertain to the space of internal models considered by a learner, and they help the learner make inferences that go beyond the observed data. A recent study found that deep neural networks optimized for object recognition develop the shape bias (Ritter et al., 2017), an inductive bias possessed by children that plays an important role in early word learning. However, these networks use unrealistically large quantities of training data, and the conditions required for these biases to develop are not well understood. Moreover, it is unclear how the learning dynamics of these networks relate to developmental processes in childhood. We investigate the development and influence of the shape bias in neural networks using controlled datasets of abstract patterns and synthetic images, allowing us to systematically vary the quantity and form of the experience provided to the learning algorithms. We find that simple neural networks develop a shape bias after seeing as few as 3 examples of 4 object categories. The development of these biases predicts the onset of vocabulary acceleration in our networks, consistent with the developmental process in children.
AB - People use rich prior knowledge about the world in order to efficiently learn new concepts. These priors-also known as “inductive biases”-pertain to the space of internal models considered by a learner, and they help the learner make inferences that go beyond the observed data. A recent study found that deep neural networks optimized for object recognition develop the shape bias (Ritter et al., 2017), an inductive bias possessed by children that plays an important role in early word learning. However, these networks use unrealistically large quantities of training data, and the conditions required for these biases to develop are not well understood. Moreover, it is unclear how the learning dynamics of these networks relate to developmental processes in childhood. We investigate the development and influence of the shape bias in neural networks using controlled datasets of abstract patterns and synthetic images, allowing us to systematically vary the quantity and form of the experience provided to the learning algorithms. We find that simple neural networks develop a shape bias after seeing as few as 3 examples of 4 object categories. The development of these biases predicts the onset of vocabulary acceleration in our networks, consistent with the developmental process in children.
KW - inductive biases
KW - learning-to-learn
KW - neural networks
KW - word learning
UR - http://www.scopus.com/inward/record.url?scp=85093249454&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85093249454&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85093249454
T3 - Proceedings of the 40th Annual Meeting of the Cognitive Science Society, CogSci 2018
SP - 1657
EP - 1662
BT - Proceedings of the 40th Annual Meeting of the Cognitive Science Society, CogSci 2018
PB - The Cognitive Science Society
T2 - 40th Annual Meeting of the Cognitive Science Society: Changing Minds, CogSci 2018
Y2 - 25 July 2018 through 28 July 2018
ER -