TY - JOUR
T1 - Ubiquitous Bias and False Discovery Due to Model Misspecification in Analysis of Statistical Interactions
T2 - The Role of the Outcome’s Distribution and Metric Properties
AU - Domingue, Benjamin W.
AU - Kanopka, Klint
AU - Trejo, Sam
AU - Rhemtulla, Mijke
AU - Tucker-Drob, Elliot M.
N1 - Publisher Copyright:
© 2022 American Psychological Association
PY - 2022
Y1 - 2022
N2 - Studies of interaction effects are of great interest because they identify crucial interplay between predictors in explaining outcomes. Previous work has considered several potential sources of statistical bias and substantive misinterpretation in the study of interactions, but less attention has been devoted to the role of the outcome variable in such research. Here, we consider bias and false discovery associated with estimates of interaction parameters as a function of the distributional and metric properties of the outcome variable. We begin by illustrating that, for a variety of noncontinuously distributed outcomes (i.e., binary and count outcomes), attempts to use the linear model for recovery leads to catastrophic levels of bias and false discovery. Next, focusing on transformations of normally distributed variables (i.e., censoring and noninterval scaling), we show that linear models again produce spurious interaction effects. We provide explanations offering geometric and algebraic intuition as to why interactions are a challenge for these incorrectly specified models. In light of these findings, we make two specific recommendations. First, a careful consideration of the outcome’s distributional properties should be a standard component of interaction studies. Second, researchers should approach research focusing on interactions with heightened levels of scrutiny.
AB - Studies of interaction effects are of great interest because they identify crucial interplay between predictors in explaining outcomes. Previous work has considered several potential sources of statistical bias and substantive misinterpretation in the study of interactions, but less attention has been devoted to the role of the outcome variable in such research. Here, we consider bias and false discovery associated with estimates of interaction parameters as a function of the distributional and metric properties of the outcome variable. We begin by illustrating that, for a variety of noncontinuously distributed outcomes (i.e., binary and count outcomes), attempts to use the linear model for recovery leads to catastrophic levels of bias and false discovery. Next, focusing on transformations of normally distributed variables (i.e., censoring and noninterval scaling), we show that linear models again produce spurious interaction effects. We provide explanations offering geometric and algebraic intuition as to why interactions are a challenge for these incorrectly specified models. In light of these findings, we make two specific recommendations. First, a careful consideration of the outcome’s distributional properties should be a standard component of interaction studies. Second, researchers should approach research focusing on interactions with heightened levels of scrutiny.
KW - Bias
KW - Discovery
KW - False
KW - Interactions
KW - Misspecification
UR - http://www.scopus.com/inward/record.url?scp=85140728555&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140728555&partnerID=8YFLogxK
U2 - 10.1037/met0000532
DO - 10.1037/met0000532
M3 - Article
AN - SCOPUS:85140728555
SN - 1082-989X
JO - Psychological Methods
JF - Psychological Methods
ER -