Universal Dependencies for Arabic

Dima Taji, Nizar Habash, Daniel Zeman

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We describe the process of creating NUDAR, a Universal Dependency treebank for Arabic. We present the conversion from the Penn Arabic Treebank to the Universal Dependency syntactic representation through an intermediate dependency representation. We discuss the challenges faced in the conversion of the trees, the decisions we made to solve them, and the validation of our conversion. We also present initial parsing results on NUDAR.
Original languageUndefined
Title of host publicationProceedings of the Third Arabic Natural Language Processing Workshop
Place of PublicationValencia, Spain
PublisherAssociation for Computational Linguistics (ACL)
Pages166-176
Number of pages11
DOIs
StatePublished - Apr 1 2017

Cite this

Taji, D., Habash, N., & Zeman, D. (2017). Universal Dependencies for Arabic. In Proceedings of the Third Arabic Natural Language Processing Workshop (pp. 166-176). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/W17-1320