Missed Connections: Lateral Thinking Puzzles for Large Language Models

Graham Todd, Tim Merino, Sam Earle, Julian Togelius

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    The Connections puzzle published each day by the New York Times tasks players with dividing a bank of sixteen words into four groups of four words that each relate to a common theme. Solving the puzzle requires both common linguistic knowledge (i.e. definitions and typical usage) as well as, in many cases, lateral or abstract thinking. This is because the four categories ascend in complexity, with the most challenging category often requiring thinking about words in uncommon ways or as parts of larger phrases. We investigate the capacity for automated AI systems to play Connections and explore the game's potential as an automated benchmark for abstract reasoning and a way to measure the semantic information encoded by data-driven linguistic systems. In particular, we study both a sentence-embedding baseline and modern large language models (LLMs). We report their accuracy on the task, measure the impacts of chain-of-thought prompting, and discuss their failure modes. Overall, we find that the Connections task is challenging yet feasible, and a strong test-bed for future work.

    Original languageEnglish (US)
    Title of host publicationProceedings of the 2024 IEEE Conference on Games, CoG 2024
    PublisherIEEE Computer Society
    ISBN (Electronic)9798350350678
    DOIs
    StatePublished - 2024
    Event6th Annual IEEE Conference on Games, CoG 2024 - Milan, Italy
    Duration: Aug 5 2024Aug 8 2024

    Publication series

    NameIEEE Conference on Computatonal Intelligence and Games, CIG
    ISSN (Print)2325-4270
    ISSN (Electronic)2325-4289

    Conference

    Conference6th Annual IEEE Conference on Games, CoG 2024
    Country/TerritoryItaly
    CityMilan
    Period8/5/248/8/24

    Keywords

    • AI
    • evaluation
    • Language models
    • reasoning

    ASJC Scopus subject areas

    • Artificial Intelligence
    • Computer Graphics and Computer-Aided Design
    • Computer Vision and Pattern Recognition
    • Human-Computer Interaction
    • Software

    Fingerprint

    Dive into the research topics of 'Missed Connections: Lateral Thinking Puzzles for Large Language Models'. Together they form a unique fingerprint.

    Cite this