The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations

Nikita Nangia, Adina Williams, Angeliki Lazaridou, Samuel R. Bowman

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    This paper presents the results of the RepEval 2017 Shared Task, which evaluated neural network sentence representation learning models on the Multi- Genre Natural Language Inference corpus (MultiNLI) recently introduced by Williams et al. (2017). All of the five participating teams beat the bidirectional LSTM (BiLSTM) and continuous bag of words baselines reported in Williams et al.. The best single model used stacked BiLSTMs with residual connections to extract sentence features and reached 74.5% accuracy on the genre-matched test set. Surprisingly, the results of the competition were fairly consistent across the genrematched and genre-mismatched test sets, and across subsets of the test data representing a variety of linguistic phenomena, suggesting that all of the submitted systems learned reasonably domainindependent representations for sentence meaning.

    Original languageEnglish (US)
    Title of host publicationRepEval 2017 - 2nd Workshop on Evaluating Vector-Space Representations for NLP, Proceedings of the Workshop
    PublisherAssociation for Computational Linguistics (ACL)
    Pages1-10
    Number of pages10
    ISBN (Electronic)9781945626906
    StatePublished - 2017
    Event2nd Workshop on Evaluating Vector-Space Representations for NLP, RepEval 2017 - Copenhagen, Denmark
    Duration: Sep 8 2017 → …

    Publication series

    NameRepEval 2017 - 2nd Workshop on Evaluating Vector-Space Representations for NLP, Proceedings of the Workshop

    Conference

    Conference2nd Workshop on Evaluating Vector-Space Representations for NLP, RepEval 2017
    Country/TerritoryDenmark
    CityCopenhagen
    Period9/8/17 → …

    ASJC Scopus subject areas

    • Software
    • Computer Science Applications

    Fingerprint

    Dive into the research topics of 'The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations'. Together they form a unique fingerprint.

    Cite this