TY - JOUR
T1 - Estimating Causal Effects of Education Interventions Using a Two-Rating Regression Discontinuity Design
T2 - Lessons From a Simulation Study and an Application
AU - Porter, Kristin E.
AU - Reardon, Sean F.
AU - Unlu, Fatih
AU - Bloom, Howard S.
AU - Cimpian, Joseph R.
N1 - Publisher Copyright:
© 2017 Taylor & Francis Group, LLC.
PY - 2017/1/2
Y1 - 2017/1/2
N2 - A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the “surface” method, the “frontier” method, the “binding-score” method, and the “fuzzy instrumental variables” method. This article uses a series of simulations to evaluate the relative performance of each of these four methods under a variety of different data-generating models. Focusing on a two-rating RDD (2RRDD), we compare the methods in terms of their bias, precision, and mean squared error when implemented as they most likely would be in practice—using optimal bandwidth selection. We also apply the lessons learned from the simulations to a real-world example that uses data from a study of an English learner reclassification policy. Overall, this article makes valuable contributions to the literature on MRRDDs in that it makes concrete recommendations for choosing among MRRDD estimation methods, for implementing any chosen method using local linear regression, and for providing accurate statistical inferences.
AB - A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the “surface” method, the “frontier” method, the “binding-score” method, and the “fuzzy instrumental variables” method. This article uses a series of simulations to evaluate the relative performance of each of these four methods under a variety of different data-generating models. Focusing on a two-rating RDD (2RRDD), we compare the methods in terms of their bias, precision, and mean squared error when implemented as they most likely would be in practice—using optimal bandwidth selection. We also apply the lessons learned from the simulations to a real-world example that uses data from a study of an English learner reclassification policy. Overall, this article makes valuable contributions to the literature on MRRDDs in that it makes concrete recommendations for choosing among MRRDD estimation methods, for implementing any chosen method using local linear regression, and for providing accurate statistical inferences.
KW - RDD
KW - multiple ratings
KW - regression discontinuity design
UR - http://www.scopus.com/inward/record.url?scp=84992109511&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84992109511&partnerID=8YFLogxK
U2 - 10.1080/19345747.2016.1219436
DO - 10.1080/19345747.2016.1219436
M3 - Article
AN - SCOPUS:84992109511
SN - 1934-5747
VL - 10
SP - 138
EP - 167
JO - Journal of Research on Educational Effectiveness
JF - Journal of Research on Educational Effectiveness
IS - 1
ER -