Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop

Nizar Habash, Owen Rambow

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present an approach to using a morphological analyzer for tokenizing andmorphologically tagging (including partof- speech tagging) Arabic words in one process. We learn classifiers for individual morphological features, as well as ways of using these classifiers to choose among entries from the output of the analyzer. We obtain accuracy rates on all tasks in the high nineties.

Original languageEnglish (US)
Title of host publicationACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages573-580
Number of pages8
ISBN (Print)1932432515, 9781932432510
DOIs
StatePublished - 2005
Event43rd Annual Meeting of the Association for Computational Linguistics, ACL-05 - Ann Arbor, MI, United States
Duration: Jun 25 2005Jun 30 2005

Publication series

NameACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference

Other

Other43rd Annual Meeting of the Association for Computational Linguistics, ACL-05
CountryUnited States
CityAnn Arbor, MI
Period6/25/056/30/05

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint Dive into the research topics of 'Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop'. Together they form a unique fingerprint.

Cite this