LTAG-spinal and the Treebank: A new resource for incremental, dependency and semantic parsing

Libin Shen, Lucas Champollion, Aravind K. Joshi

    Research output: Contribution to journalArticlepeer-review

    Abstract

    We introduce LTAG-spinal, a novel variant of traditional Lexicalized Tree Adjoining Grammar (LTAG) with desirable linguistic, computational and statistical properties. Unlike in traditional LTAG, subcategorization frames and the argument-adjunct distinction are left underspecified in LTAG-spinal. LTAG-spinal with adjunction constraints is weakly equivalent to LTAG. The LTAG-spinal formalism is used to extract an LTAG-spinal Treebank from the Penn Treebank with Propbank annotation. Based on Propbank annotation, predicate coordination and LTAG adjunction structures are successfully extracted. The LTAG-spinal Treebank makes explicit semantic relations that are implicit or absent from the original PTB. LTAG-spinal provides a very desirable resource for statistical LTAG parsing, incremental parsing, dependency parsing, and semantic parsing. This treebank has been successfully used to train an incremental LTAG-spinal parser and a bidirectional LTAG dependency parser.

    Original languageEnglish (US)
    Pages (from-to)1-19
    Number of pages19
    JournalLanguage Resources and Evaluation
    Volume42
    Issue number1
    DOIs
    StatePublished - Mar 2008

    Keywords

    • Dependency parsing
    • LTAG-spinal
    • Tree adjoining grammar
    • Treebank

    ASJC Scopus subject areas

    • Language and Linguistics
    • Education
    • Linguistics and Language
    • Library and Information Sciences

    Fingerprint

    Dive into the research topics of 'LTAG-spinal and the Treebank: A new resource for incremental, dependency and semantic parsing'. Together they form a unique fingerprint.

    Cite this