Probing what different NLP tasks teach machines about function word comprehension

Najoung Kim, Roma Patel, Adam Poliak, Alex Wang, Patrick Xia, R. Thomas McCoy, Ian Tenney, Alexis Ross, Tal Linzen, Benjamin Van Durme, Samuel R. Bowman, Ellie Pavlick

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    We introduce a set of nine challenge tasks that test for the understanding of function words. These tasks are created by structurally mutating sentences from existing datasets to target the comprehension of specific types of function words (e.g., prepositions, wh-words). Using these probing tasks, we explore the effects of various pretraining objectives for sentence encoders (e.g., language modeling, CCG supertagging and natural language inference (NLI)) on the learned representations. Our results show that pretraining on language modeling performs the best on average across our probing tasks, supporting its widespread use for pretraining state-of-the-art NLP models, and CCG supertagging and NLI pretraining perform comparably. Overall, no pretraining objective dominates across the board, and our function word probing tasks highlight several intuitive differences between pretraining objectives, e.g., that NLI helps the comprehension of negation.

    Original languageEnglish (US)
    Title of host publication*SEM@NAACL-HLT 2019 - 8th Joint Conference on Lexical and Computational Semantics
    PublisherAssociation for Computational Linguistics (ACL)
    Pages235-249
    Number of pages15
    ISBN (Electronic)9781948087933
    StatePublished - 2019
    Event8th Joint Conference on Lexical and Computational Semantics, *SEM@NAACL-HLT 2019 - Minneapolis, United States
    Duration: Jun 6 2019Jun 7 2019

    Publication series

    Name*SEM@NAACL-HLT 2019 - 8th Joint Conference on Lexical and Computational Semantics

    Conference

    Conference8th Joint Conference on Lexical and Computational Semantics, *SEM@NAACL-HLT 2019
    Country/TerritoryUnited States
    CityMinneapolis
    Period6/6/196/7/19

    ASJC Scopus subject areas

    • Information Systems
    • Computer Science Applications
    • Computational Theory and Mathematics

    Fingerprint

    Dive into the research topics of 'Probing what different NLP tasks teach machines about function word comprehension'. Together they form a unique fingerprint.

    Cite this