Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine

Ramy Eskander, Nizar Habash, Owen Rambow, Arfath Pasha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.

Original languageEnglish (US)
Title of host publicationCOLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016
Subtitle of host publicationTechnical Papers
PublisherAssociation for Computational Linguistics, ACL Anthology
Pages3455-3465
Number of pages11
ISBN (Print)9784879747020
StatePublished - 2016
Event26th International Conference on Computational Linguistics, COLING 2016 - Osaka, Japan
Duration: Dec 11 2016Dec 16 2016

Publication series

NameCOLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers

Other

Other26th International Conference on Computational Linguistics, COLING 2016
Country/TerritoryJapan
CityOsaka
Period12/11/1612/16/16

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine'. Together they form a unique fingerprint.

Cite this