Feature Optimization for Predicting Readability of Arabic L1 and L2

Hind Saddiki, Nizar Habash, Violetta Cavalli-Sforza, Muhamed Al Khalil

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Advances in automatic readability assessment can impact the way people consume information in a number of domains. Arabic, being a low-resource and morphologically complex language, presents numerous challenges to the task of automatic readability assessment. In this paper, we present the largest and most in-depth computational readability study for Arabic to date. We study a large set of features with varying depths, from shallow words to syntactic trees, for both L1 and L2 readability tasks. Our best L1 readability accuracy result is 94.8% (75% error reduction from a commonly used baseline). The comparable results for L2 are 72.4% (45% error reduction). We also demonstrate the added value of leveraging L1 features for L2 readability prediction.

Original languageEnglish (US)
Title of host publicationACL 2018 - Natural Language Processing Techniques for Educational Applications, Proceedings of the 5th Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages20-29
Number of pages10
ISBN (Electronic)9781948087353
StatePublished - 2018
EventACL 2018 5th Workshop on Natural Language Processing Techniques for Educational Applications, NLPTEA 2018 - Melbourne, Australia
Duration: Jul 19 2018 → …

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

ConferenceACL 2018 5th Workshop on Natural Language Processing Techniques for Educational Applications, NLPTEA 2018
Country/TerritoryAustralia
CityMelbourne
Period7/19/18 → …

ASJC Scopus subject areas

  • Computer Science Applications
  • Linguistics and Language
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'Feature Optimization for Predicting Readability of Arabic L1 and L2'. Together they form a unique fingerprint.

Cite this