Arabic morphological tagging, diacritization, and lemmatization using lexeme models and feature ranking

Ryan Roth, Owen Rambow, Nizar Habash, Mona Diab, Cynthia Rudin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We investigate the tasks of general morphological tagging, diacritization, and lemmatization for Arabic. We show that for all tasks we consider, both modeling the lexeme explicitly, and retuning the weights of individual classifiers for the specific task, improve the performance.

Original languageEnglish (US)
Title of host publicationACL-08
Subtitle of host publicationHLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages117-120
Number of pages4
ISBN (Print)9781932432046
DOIs
StatePublished - 2008
Event46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-08: HLT - Columbus, OH, United States
Duration: Jun 15 2008Jun 20 2008

Publication series

NameACL-08: HLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference

Other

Other46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-08: HLT
Country/TerritoryUnited States
CityColumbus, OH
Period6/15/086/20/08

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Networks and Communications
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Arabic morphological tagging, diacritization, and lemmatization using lexeme models and feature ranking'. Together they form a unique fingerprint.

Cite this