TY - GEN
T1 - Automatic transliteration of romanized dialectal Arabic
AU - Al-Badrashiny, Mohamed
AU - Eskander, Ramy
AU - Habash, Nizar
AU - Rambow, Owen
PY - 2014/1/1
Y1 - 2014/1/1
N2 - In this paper, we address the problem of converting Dialectal Arabic (DA) text that is written in the Latin script (called Arabizi) into Arabic script following the CODA convention for DA orthography. The presented system uses a finite state transducer trained at the character level to generate all possible transliterations for the input Arabizi words. We then filter the generated list using a DA morphological analyzer. After that we pick the best choice for each input word using a language model. We achieve an accuracy of 69.4% on an unseen test set compared to 63.1% using a system which represents a previously proposed approach.
AB - In this paper, we address the problem of converting Dialectal Arabic (DA) text that is written in the Latin script (called Arabizi) into Arabic script following the CODA convention for DA orthography. The presented system uses a finite state transducer trained at the character level to generate all possible transliterations for the input Arabizi words. We then filter the generated list using a DA morphological analyzer. After that we pick the best choice for each input word using a language model. We achieve an accuracy of 69.4% on an unseen test set compared to 63.1% using a system which represents a previously proposed approach.
UR - http://www.scopus.com/inward/record.url?scp=84942564430&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84942564430&partnerID=8YFLogxK
M3 - Conference contribution
T3 - CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings
SP - 30
EP - 38
BT - CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings
PB - Association for Computational Linguistics (ACL)
T2 - 18th Conference on Computational Natural Language Learning, CoNLL 2014
Y2 - 26 June 2014 through 27 June 2014
ER -