TY - JOUR
T1 - SIS
T2 - An R package for sure independence screening in ultrahigh-dimensional statistical models
AU - Saldana, Diego Franco
AU - Feng, Yang
N1 - Publisher Copyright:
© 2018, American Statistical Association. All rights reserved.
PY - 2018
Y1 - 2018
N2 - We revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (ISIS) and all of its variants. For the regularization steps in the ISIS recruiting process, available penalties include the LASSO, SCAD, and MCP while the implemented variants for the screening steps are sample splitting, data-driven thresholding, and combinations thereof. Performance of these feature selection techniques is investigated by means of real and simulated data sets, where we find considerable improvements in terms of model selection and computational time between our algorithms and traditional penalized pseudo-likelihood methods applied directly to the full set of covariates.
AB - We revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (ISIS) and all of its variants. For the regularization steps in the ISIS recruiting process, available penalties include the LASSO, SCAD, and MCP while the implemented variants for the screening steps are sample splitting, data-driven thresholding, and combinations thereof. Performance of these feature selection techniques is investigated by means of real and simulated data sets, where we find considerable improvements in terms of model selection and computational time between our algorithms and traditional penalized pseudo-likelihood methods applied directly to the full set of covariates.
KW - Cox model
KW - Generalized linear models
KW - Penalized likelihood estimation
KW - Sparsity
KW - Sure independence screening
KW - Variable selection
UR - http://www.scopus.com/inward/record.url?scp=85042466528&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85042466528&partnerID=8YFLogxK
U2 - 10.18637/jss.v083.i02
DO - 10.18637/jss.v083.i02
M3 - Article
AN - SCOPUS:85042466528
SN - 1548-7660
VL - 83
JO - Journal of Statistical Software
JF - Journal of Statistical Software
ER -