TY - GEN
T1 - Pos-tagging of tunisian dialect using standard arabic resources and tools
AU - Hamdi, Ahmed
AU - Nasr, Alexis
AU - Habash, Nizar
AU - Gala, Núria
N1 - Publisher Copyright:
© ACL 2015. All rights reserved.
PY - 2015
Y1 - 2015
N2 - Developing natural language processing tools usually requires a large number of resources (lexica, annotated corpora, etc.), which often do not exist for less-resourced languages. One way to overcome the problem of lack of resources is to devote substantial efforts to build new ones from scratch. Another approach is to exploit existing resources of closely related languages. In this paper, we focus on developing a part-of-speech tagger for the Tunisian Arabic dialect (TUN), a lowresource language, by exploiting its closeness to Modern Standard Arabic (MSA), which has many state-of-the-art resources and tools. Our system achieved an accuracy of 89% (∼20% absolute improvement over an MSA tagger baseline).
AB - Developing natural language processing tools usually requires a large number of resources (lexica, annotated corpora, etc.), which often do not exist for less-resourced languages. One way to overcome the problem of lack of resources is to devote substantial efforts to build new ones from scratch. Another approach is to exploit existing resources of closely related languages. In this paper, we focus on developing a part-of-speech tagger for the Tunisian Arabic dialect (TUN), a lowresource language, by exploiting its closeness to Modern Standard Arabic (MSA), which has many state-of-the-art resources and tools. Our system achieved an accuracy of 89% (∼20% absolute improvement over an MSA tagger baseline).
UR - http://www.scopus.com/inward/record.url?scp=84977508326&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84977508326&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84977508326
T3 - 2nd Workshop on Arabic Natural Language Processing, ANLP 2015 - held at 53rd Annual Meeting of the Association for Computational Linguistics, ACL 2015 - Proceedings
SP - 59
EP - 68
BT - 2nd Workshop on Arabic Natural Language Processing, ANLP 2015 - held at 53rd Annual Meeting of the Association for Computational Linguistics, ACL 2015 - Proceedings
A2 - Habash, Nizar
A2 - Vogel, Stephan
A2 - Darwish, Kareem
PB - Association for Computational Linguistics (ACL)
T2 - 2nd Workshop on Arabic Natural Language Processing, ANLP 2015
Y2 - 30 July 2015
ER -