TY - GEN
T1 - One-step statistical parsing of hybrid dependency-constituency syntactic representations
AU - Dukes, Kais
AU - Habash, Nizar
N1 - Publisher Copyright:
© 2011 Association for Computational Linguistics
PY - 2011
Y1 - 2011
N2 - In this paper, we describe and compare two statistical parsing approaches for the hybrid dependency-constituency syntactic representation used in the Quranic Arabic Treebank (Dukes and Buckwalter, 2010). In our first approach, we apply a multi-step process in which we use a shift-reduce algorithm trained on a pure dependency preprocessed version of the treebank. After parsing, the dependency output is converted into the hybrid representation. This is compared to a novel one-step parser that is able to learn the hybrid representation without preprocessing. We define an extended labelled attachment score (ELAS) as our performance metric for hybrid parsing, and report 87.47% (F1 score) for the multi-step approach, and 89.03% (F1 score) for the one-step integrated algorithm. We also consider the effect of using different sets of morphological features for parsing the Quran, comparing our results to recent work on Modern Standard Arabic.
AB - In this paper, we describe and compare two statistical parsing approaches for the hybrid dependency-constituency syntactic representation used in the Quranic Arabic Treebank (Dukes and Buckwalter, 2010). In our first approach, we apply a multi-step process in which we use a shift-reduce algorithm trained on a pure dependency preprocessed version of the treebank. After parsing, the dependency output is converted into the hybrid representation. This is compared to a novel one-step parser that is able to learn the hybrid representation without preprocessing. We define an extended labelled attachment score (ELAS) as our performance metric for hybrid parsing, and report 87.47% (F1 score) for the multi-step approach, and 89.03% (F1 score) for the one-step integrated algorithm. We also consider the effect of using different sets of morphological features for parsing the Quran, comparing our results to recent work on Modern Standard Arabic.
UR - http://www.scopus.com/inward/record.url?scp=85010190896&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85010190896&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85010190896
T3 - IWPT 2011 - Proceedings of the 12th International Conference on Parsing Technologies
SP - 92
EP - 103
BT - IWPT 2011 - Proceedings of the 12th International Conference on Parsing Technologies
PB - Association for Computational Linguistics (ACL)
T2 - 12th International Conference on Parsing Technologies, IWPT 2011
Y2 - 5 October 2011 through 7 October 2011
ER -