Correction annotation for non-native Arabic texts: Guidelines and corpus

Wajdi Zaghouani, Nizar Habash, Houda Bouamor, Alla Rozovskaya, Behrang Mohit, Abeer Heider, Kemal Oflazer

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present our correction annotation guidelines to create a manually corrected nonnative (L2) Arabic corpus. We develop our approach by extending an L1 large-scale Arabic corpus and its manual corrections, to include manually corrected non-native Arabic learner essays. Our overarching goal is to use the annotated corpus to develop components for automatic detection and correction of language errors that can be used to help Standard Arabic learners (native and non-native) improve the quality of the Arabic text they produce. The created corpus of L2 text manual corrections is the largest to date. We evaluate our guidelines using inter-annotator agreement and show a high degree of consistency.

Original languageEnglish (US)
Title of host publicationLAW 2015 - 9th Linguistic Annotation Workshop, held in conjuncion with NAACL 2015 - Proceedings of the Workshop
EditorsAdam Meyers, Ines Rehbein, Heike Zinsmeister
PublisherAssociation for Computational Linguistics (ACL)
Pages129-139
Number of pages11
ISBN (Electronic)9781941643471
StatePublished - 2020
Event9th Linguistic Annotation Workshop, LAW 2015, held in conjuncion with NAACL 2015 - Denver, United States
Duration: Jun 5 2015 → …

Publication series

NameLAW 2015 - 9th Linguistic Annotation Workshop, held in conjuncion with NAACL 2015 - Proceedings of the Workshop

Conference

Conference9th Linguistic Annotation Workshop, LAW 2015, held in conjuncion with NAACL 2015
CountryUnited States
CityDenver
Period6/5/15 → …

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint Dive into the research topics of 'Correction annotation for non-native Arabic texts: Guidelines and corpus'. Together they form a unique fingerprint.

Cite this