We describe the process of creating NUDAR, a Universal Dependency treebank for Arabic. We present the conversion from the Penn Arabic Treebank to the Universal Dependency syntactic representation through an intermediate dependency representation. We discuss the challenges faced in the conversion of the trees, the decisions we made to solve them, and the validation of our conversion. We also present initial parsing results on NUDAR.
|Title of host publication||Proceedings of the Third Arabic Natural Language Processing Workshop|
|Place of Publication||Valencia, Spain|
|Publisher||Association for Computational Linguistics (ACL)|
|Number of pages||11|
|State||Published - Apr 1 2017|