Jump to better conclusions: SCAN both left and right

Jasmijn Bastings, Marco Baroni, Jason Weston, Kyunghyun Cho, Douwe Kiela

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Lake and Baroni (2018) recently introduced the SCAN data set, which consists of simple commands paired with action sequences and is intended to test the strong generalization abilities of recurrent sequence-to-sequence models. Their initial experiments suggested that such models may fail because they lack the ability to extract systematic rules. Here, we take a closer look at SCAN and show that it does not always capture the kind of generalization that it was designed for. To mitigate this we propose a complementary dataset, which requires mapping actions back to the original commands, called NACS. We show that models that do well on SCAN do not necessarily do well on NACS, and that NACS exhibits properties more closely aligned with realistic use-cases for sequence-to-sequence models.

Original languageEnglish (US)
Title of host publicationEMNLP 2018 - 2018 EMNLP Workshop BlackboxNLP
Subtitle of host publicationAnalyzing and Interpreting Neural Networks for NLP, Proceedings of the 1st Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages47-55
Number of pages9
ISBN (Electronic)9781948087711
StatePublished - 2018
Event1st Workshop on BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, co-located with the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018 - Brussels, Belgium
Duration: Nov 1 2018 → …

Publication series

NameEMNLP 2018 - 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, Proceedings of the 1st Workshop

Conference

Conference1st Workshop on BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, co-located with the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
Country/TerritoryBelgium
CityBrussels
Period11/1/18 → …

ASJC Scopus subject areas

  • Computer Science Applications
  • Computational Theory and Mathematics
  • Information Systems

Fingerprint

Dive into the research topics of 'Jump to better conclusions: SCAN both left and right'. Together they form a unique fingerprint.

Cite this