Modeling relationships between two categorical variables when data are missing: Examining consequences of the missing data mechanism in an HIV data set

Shiela M. Strauss, David M. Rindskopf, Gregory P. Falkin

Research output: Contribution to journalArticlepeer-review

Abstract

Analysts evaluating the strengths of relationships between variables in behavioral science research must often contend with the problem of missing data. Analyses are typically performed using data for cases that are either complete in all the variables, or assume that the data are missing at random. Often, these approaches yield biased results. Using empirical data, the current work explores the implications and consequences of using various statistical models to describe the association of two variables, one ordinal and one dichotomous, in which data are incomplete for the dichotomous variable. These models explicitly reflect the missing data mechanism; models that hypothesize nonignorable nonresponse are given particular attention. Both the statistical fit and substantive consequences of these models are examined. This new methodological approach to examining nonignorable nonresponse can be applied to many behavioral science data sets containing an ordinal variable.

Original languageEnglish (US)
Pages (from-to)471-500
Number of pages30
JournalMultivariate Behavioral Research
Volume36
Issue number4
DOIs
StatePublished - 2001

ASJC Scopus subject areas

  • Statistics and Probability
  • Experimental and Cognitive Psychology
  • Arts and Humanities (miscellaneous)

Fingerprint

Dive into the research topics of 'Modeling relationships between two categorical variables when data are missing: Examining consequences of the missing data mechanism in an HIV data set'. Together they form a unique fingerprint.

Cite this