TY - GEN
T1 - Feature Optimization for Predicting Readability of Arabic L1 and L2
AU - Saddiki, Hind
AU - Habash, Nizar
AU - Cavalli-Sforza, Violetta
AU - Khalil, Muhamed Al
N1 - Publisher Copyright:
© 2018 Association for Computational Linguistics.
PY - 2018
Y1 - 2018
N2 - Advances in automatic readability assessment can impact the way people consume information in a number of domains. Arabic, being a low-resource and morphologically complex language, presents numerous challenges to the task of automatic readability assessment. In this paper, we present the largest and most in-depth computational readability study for Arabic to date. We study a large set of features with varying depths, from shallow words to syntactic trees, for both L1 and L2 readability tasks. Our best L1 readability accuracy result is 94.8% (75% error reduction from a commonly used baseline). The comparable results for L2 are 72.4% (45% error reduction). We also demonstrate the added value of leveraging L1 features for L2 readability prediction.
AB - Advances in automatic readability assessment can impact the way people consume information in a number of domains. Arabic, being a low-resource and morphologically complex language, presents numerous challenges to the task of automatic readability assessment. In this paper, we present the largest and most in-depth computational readability study for Arabic to date. We study a large set of features with varying depths, from shallow words to syntactic trees, for both L1 and L2 readability tasks. Our best L1 readability accuracy result is 94.8% (75% error reduction from a commonly used baseline). The comparable results for L2 are 72.4% (45% error reduction). We also demonstrate the added value of leveraging L1 features for L2 readability prediction.
UR - http://www.scopus.com/inward/record.url?scp=85100256532&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100256532&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85100256532
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 20
EP - 29
BT - ACL 2018 - Natural Language Processing Techniques for Educational Applications, Proceedings of the 5th Workshop
PB - Association for Computational Linguistics (ACL)
T2 - ACL 2018 5th Workshop on Natural Language Processing Techniques for Educational Applications, NLPTEA 2018
Y2 - 19 July 2018
ER -