Instruction Induction: From Few Examples to Natural Language Task Descriptions

Or Honovich, Uri Shaham, Samuel R. Bowman, Omer Levy

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Large language models are able to perform a task by conditioning on a few input-output demonstrations - a paradigm known as in-context learning. We show that language models can explicitly infer an underlying task from a few demonstrations by prompting them to generate a natural language instruction that fits the examples. To explore this ability, we introduce the instruction induction challenge, compile a dataset consisting of 24 tasks, and define a novel evaluation metric based on executing the generated instruction. We discover that, to a large extent, the ability to generate instructions does indeed emerge when using a model that is both large enough and aligned to follow instructions; InstructGPT achieves 65.7% of human performance in our execution-based metric, while the original GPT-3 model reaches only 9.8% of human performance. This surprising result suggests that instruction induction might be a viable learning paradigm in and of itself, where instead of fitting a set of latent continuous parameters to the data, one searches for the best description in the natural language hypothesis space.

    Original languageEnglish (US)
    Title of host publicationLong Papers
    PublisherAssociation for Computational Linguistics (ACL)
    Pages1935-1952
    Number of pages18
    ISBN (Electronic)9781959429722
    StatePublished - 2023
    Event61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Toronto, Canada
    Duration: Jul 9 2023Jul 14 2023

    Publication series

    NameProceedings of the Annual Meeting of the Association for Computational Linguistics
    Volume1
    ISSN (Print)0736-587X

    Conference

    Conference61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
    Country/TerritoryCanada
    CityToronto
    Period7/9/237/14/23

    ASJC Scopus subject areas

    • Computer Science Applications
    • Linguistics and Language
    • Language and Linguistics

    Fingerprint

    Dive into the research topics of 'Instruction Induction: From Few Examples to Natural Language Task Descriptions'. Together they form a unique fingerprint.

    Cite this