Twenty-five years of information extraction

Research output: Contribution to journalReview article

Abstract

Information extraction is the process of converting unstructured text into a structured data base containing selected information from the text. It is an essential step in making the information content of the text usable for further processing. In this paper, we describe how information extraction has changed over the past 25 years, moving from hand-coded rules to neural networks, with a few stops on the way. We connect these changes to research advances in NLP and to the evaluations organized by the US Government.

Original languageEnglish (US)
Pages (from-to)677-692
Number of pages16
JournalNatural Language Engineering
Volume25
Issue number6
DOIs
StatePublished - Nov 1 2019

Keywords

  • Information extraction
  • Message understanding

ASJC Scopus subject areas

  • Software
  • Language and Linguistics
  • Linguistics and Language
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Twenty-five years of information extraction'. Together they form a unique fingerprint.

Cite this