How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech

Aditya Yedetore, Tal Linzen, Robert Frank, R. Thomas McCoy

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    When acquiring syntax, children consistently choose hierarchical rules over competing non-hierarchical possibilities. Is this preference due to a learning bias for hierarchical structure, or due to more general biases that interact with hierarchical cues in children's linguistic input? We explore these possibilities by training LSTMs and Transformers-two types of neural networks without a hierarchical bias-on data similar in quantity and content to children's linguistic input: text from the CHILDES corpus. We then evaluate what these models have learned about English yes/no questions, a phenomenon for which hierarchical structure is crucial. We find that, though they perform well at capturing the surface statistics of child-directed speech (as measured by perplexity), both model types generalize in a way more consistent with an incorrect linear rule than the correct hierarchical rule. These results suggest that human-like generalization from text alone requires stronger biases than the general sequence-processing biases of standard neural network architectures.

    Original languageEnglish (US)
    Title of host publicationLong Papers
    PublisherAssociation for Computational Linguistics (ACL)
    Pages9370-9393
    Number of pages24
    ISBN (Electronic)9781959429722
    StatePublished - 2023
    Event61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Toronto, Canada
    Duration: Jul 9 2023Jul 14 2023

    Publication series

    NameProceedings of the Annual Meeting of the Association for Computational Linguistics
    Volume1
    ISSN (Print)0736-587X

    Conference

    Conference61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
    Country/TerritoryCanada
    CityToronto
    Period7/9/237/14/23

    ASJC Scopus subject areas

    • Computer Science Applications
    • Linguistics and Language
    • Language and Linguistics

    Fingerprint

    Dive into the research topics of 'How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech'. Together they form a unique fingerprint.

    Cite this