Abstract
This paper proposes a new task of cross-document event extraction and tracking and its evaluation metrics. We identify important person entities which are frequently involved in events as 'centroid entities'. Then we link the events involving the same centroid entity along a time line. We also present a system performing this task and our current approaches to address the main research challenges. We demonstrate that global inference from background knowledge and cross-document event aggregation are crucial to enhance the performance. This new task defines several extensions to the traditional single-document Information Extraction paradigm beyond 'slot filling'.
Original language | English (US) |
---|---|
Title of host publication | International Conference Recent Advances in Natural Language Processing, RANLP |
Pages | 166-172 |
Number of pages | 7 |
State | Published - 2009 |
Event | International Conference on Recent Advances in Natural Language Processing, RANLP-2009 - Borovets, Bulgaria Duration: Sep 14 2009 → Sep 16 2009 |
Other
Other | International Conference on Recent Advances in Natural Language Processing, RANLP-2009 |
---|---|
Country/Territory | Bulgaria |
City | Borovets |
Period | 9/14/09 → 9/16/09 |
Keywords
- Cross-document extraction
- Event
- Information extraction
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Software
- Electrical and Electronic Engineering