TY - JOUR
T1 - Matching and semi-parametric IV estimation, a distance-based measure of migration, and the wages of young men
AU - Ham, John C.
AU - Li, Xianghong
AU - Reagan, Patricia B.
N1 - Funding Information:
An earlier version of this paper was presented under the title “Matching and Selection Estimates of the Effect of Migration on Wages for Young Men”. Geert Ridder, Barbara Sianesi and Barry Smith made numerous extremely helpful comments on the paper. We would also like to thank Dwayne Benjamin, Stephen Cosslett, Songnian Chen, Xiaohong Chen, William Greene, Cheng Hsiao, Guido Imbens, John Kennan, Lung-fei Lee, Audrey Light, Aloysius Siow, Jeffrey Smith, Petra Todd, Insan Tunali, Bruce Weinberg, Tiemen Woutersen, James Walker and Jeff Yankow for very helpful comments and discussions. We are also grateful to seminar participants at Arizona, CEMFI, the Federal Reserve Bank of New York, McGill, University of Montreal, McMaster, Minnesota (Industrial Relations), Ohio State, Pompeu Fabra, Rutgers, Toronto, UC Berkeley, UC Davis, UC San Diego, USC, Vanderbilt, Western Ontario, Wisconsin and the Upjohn Institute, as well as at the Econometric Society and Society of Labor Economics meetings. Finally we thank three anonymous referees and a Co-Editor for very helpful comments which led to a much improved paper. Eileen Kopchik and Yong Yu provided outstanding programming help. This research was partially supported by the NSF and NICHD . We emphasize that we are responsible for all errors. This paper reflects the views of the authors and in no way represents the views of the NSF or NICHD.
PY - 2011/4/1
Y1 - 2011/4/1
N2 - Our paper estimates the effect of US internal migration on wage growth for young men between their first and second job. Our analysis of migration extends previous research by: (i) exploiting the distance-based measures of migration in the National Longitudinal Surveys of Youth 1979 (NLSY79); (ii) allowing the effect of migration to differ by schooling level and (iii) using propensity score matching to estimate the average treatment effect on the treated (ATET) for movers and (iv) using local average treatment effect (LATE) estimators with covariates to estimate the average treatment effect (ATE) and ATET for compliers. We believe the Conditional Independence Assumption (CIA) is reasonable for our matching estimators since the NLSY79 provides a relatively rich array of variables on which to match. Our matching methods are based on local linear, local cubic, and local linear ridge regressions. Local linear and local ridge regression matching produce relatively similar point estimates and standard errors, while local cubic regression matching badly over-fits the data and provides very noisy estimates. We use the bootstrap to calculate standard errors. Since the validity of the bootstrap has not been investigated for the matching estimators we use, and has been shown to be invalid for nearest neighbor matching estimators, we conduct a Monte Carlo study on the appropriateness of using the bootstrap to calculate standard errors for local linear regression matching. The data generating processes in our Monte Carlo study are relatively rich and calibrated to match our empirical models or to test the sensitivity of our results to the choice of parameter values. The estimated standard errors from the bootstrap are very close to those from the Monte Carlo experiments, which lends support to our using the bootstrap to calculate standard errors in our setting. From the matching estimators we find a significant positive effect of migration on the wage growth of college graduates, and a marginally significant negative effect for high school dropouts. We do not find any significant effects for other educational groups or for the overall sample. Our results are generally robust to changes in the model specification and changes in our distance-based measure of migration. We find that better data matters; if we use a measure of migration based on moving across county lines, we overstate the number of moves, while if we use a measure based on moving across state lines, we understate the number of moves. Further, using either the county or state measures leads to much less precise estimates. We also consider semi-parametric LATE estimators with covariates (Frlich 2007), using two sets of instrumental variables. We precisely estimate the proportion of compliers in our data, but because we have a small number of compliers, we cannot obtain precise LATE estimates.
AB - Our paper estimates the effect of US internal migration on wage growth for young men between their first and second job. Our analysis of migration extends previous research by: (i) exploiting the distance-based measures of migration in the National Longitudinal Surveys of Youth 1979 (NLSY79); (ii) allowing the effect of migration to differ by schooling level and (iii) using propensity score matching to estimate the average treatment effect on the treated (ATET) for movers and (iv) using local average treatment effect (LATE) estimators with covariates to estimate the average treatment effect (ATE) and ATET for compliers. We believe the Conditional Independence Assumption (CIA) is reasonable for our matching estimators since the NLSY79 provides a relatively rich array of variables on which to match. Our matching methods are based on local linear, local cubic, and local linear ridge regressions. Local linear and local ridge regression matching produce relatively similar point estimates and standard errors, while local cubic regression matching badly over-fits the data and provides very noisy estimates. We use the bootstrap to calculate standard errors. Since the validity of the bootstrap has not been investigated for the matching estimators we use, and has been shown to be invalid for nearest neighbor matching estimators, we conduct a Monte Carlo study on the appropriateness of using the bootstrap to calculate standard errors for local linear regression matching. The data generating processes in our Monte Carlo study are relatively rich and calibrated to match our empirical models or to test the sensitivity of our results to the choice of parameter values. The estimated standard errors from the bootstrap are very close to those from the Monte Carlo experiments, which lends support to our using the bootstrap to calculate standard errors in our setting. From the matching estimators we find a significant positive effect of migration on the wage growth of college graduates, and a marginally significant negative effect for high school dropouts. We do not find any significant effects for other educational groups or for the overall sample. Our results are generally robust to changes in the model specification and changes in our distance-based measure of migration. We find that better data matters; if we use a measure of migration based on moving across county lines, we overstate the number of moves, while if we use a measure based on moving across state lines, we understate the number of moves. Further, using either the county or state measures leads to much less precise estimates. We also consider semi-parametric LATE estimators with covariates (Frlich 2007), using two sets of instrumental variables. We precisely estimate the proportion of compliers in our data, but because we have a small number of compliers, we cannot obtain precise LATE estimates.
KW - LATE
KW - Propensity score matching
KW - US internal migration
UR - http://www.scopus.com/inward/record.url?scp=79952453863&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79952453863&partnerID=8YFLogxK
U2 - 10.1016/j.jeconom.2010.12.004
DO - 10.1016/j.jeconom.2010.12.004
M3 - Article
AN - SCOPUS:79952453863
SN - 0304-4076
VL - 161
SP - 208
EP - 227
JO - Journal of Econometrics
JF - Journal of Econometrics
IS - 2
ER -