SIS: An R package for sure independence screening in ultrahigh-dimensional statistical models

Diego Franco Saldana, Yang Feng

Research output: Contribution to journalArticlepeer-review

Abstract

We revisit sure independence screening procedures for variable selection in generalized linear models and the Cox proportional hazards model. Through the publicly available R package SIS, we provide a unified environment to carry out variable selection using iterative sure independence screening (ISIS) and all of its variants. For the regularization steps in the ISIS recruiting process, available penalties include the LASSO, SCAD, and MCP while the implemented variants for the screening steps are sample splitting, data-driven thresholding, and combinations thereof. Performance of these feature selection techniques is investigated by means of real and simulated data sets, where we find considerable improvements in terms of model selection and computational time between our algorithms and traditional penalized pseudo-likelihood methods applied directly to the full set of covariates.

Original languageEnglish (US)
JournalJournal of Statistical Software
Volume83
DOIs
StatePublished - 2018

Keywords

  • Cox model
  • Generalized linear models
  • Penalized likelihood estimation
  • Sparsity
  • Sure independence screening
  • Variable selection

ASJC Scopus subject areas

  • Software
  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint Dive into the research topics of 'SIS: An R package for sure independence screening in ultrahigh-dimensional statistical models'. Together they form a unique fingerprint.

Cite this