TY - JOUR
T1 - Clustering of risk behaviors and classification performance in modeling adolescent risk
T2 - The example of the association between E-cigarette use and cigarette smoking
AU - Selya, Arielle
AU - Niaura, Raymond
AU - Kim, Sooyong
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/8/1
Y1 - 2025/8/1
N2 - Background: Adolescent behavioral risks are highly correlated, complicating interpretation of narrowly-focused research (e.g. 2–3 variables). We explore methodological issues when interpreting a narrowly-focused association (typically using a causal-inference framework) vs. a wider approach incorporating many correlated risk factors (using a less-common predictive-inference framework), using the currently-relevant example of adolescent e-cigarette use and cigarette smoking. Methods: Data were drawn from the Adolescent Behaviors and Experiences (ABES) study, national survey of U.S. youth, and behavioral risks were grouped into categories of e-cigarette use, cigarette smoking, other tobacco use, alcohol and cannabis use, illicit drug use, mental health, violence, other risky behaviors, and parental monitoring. Three exploratory data analyses (Spearman correlation, non-metric multidimensional scaling, and divisive hierarchical clustering) examined clustering/grouping across variables. Logistic regressions examined 1) the association between e-cigarette use and smoking and 2) the reverse-direction association, after successively adjusting for groups of risk factors. Ten-fold cross-validation was performed to evaluate predictive validity. Results: In three exploratory data analyses, e-cigarette use and cigarette smoking were correlated, but each was more closely related to other variables (alcohol and cannabis use vs. other tobacco use and illicit drugs, respectively). Logistic regression models showed similar odds ratios for the forward- and reverse-direction models, but cross-validation testing showed that the reverse-direction model had better classification performance. Conclusions: A narrow focus on adolescent risk behaviors with a causal-inference framework can result in erroneous interpretation in the presence of many correlated risk factors. A wider predictive-inference perspective can help inform better screening strategies and potential intervention targets.
AB - Background: Adolescent behavioral risks are highly correlated, complicating interpretation of narrowly-focused research (e.g. 2–3 variables). We explore methodological issues when interpreting a narrowly-focused association (typically using a causal-inference framework) vs. a wider approach incorporating many correlated risk factors (using a less-common predictive-inference framework), using the currently-relevant example of adolescent e-cigarette use and cigarette smoking. Methods: Data were drawn from the Adolescent Behaviors and Experiences (ABES) study, national survey of U.S. youth, and behavioral risks were grouped into categories of e-cigarette use, cigarette smoking, other tobacco use, alcohol and cannabis use, illicit drug use, mental health, violence, other risky behaviors, and parental monitoring. Three exploratory data analyses (Spearman correlation, non-metric multidimensional scaling, and divisive hierarchical clustering) examined clustering/grouping across variables. Logistic regressions examined 1) the association between e-cigarette use and smoking and 2) the reverse-direction association, after successively adjusting for groups of risk factors. Ten-fold cross-validation was performed to evaluate predictive validity. Results: In three exploratory data analyses, e-cigarette use and cigarette smoking were correlated, but each was more closely related to other variables (alcohol and cannabis use vs. other tobacco use and illicit drugs, respectively). Logistic regression models showed similar odds ratios for the forward- and reverse-direction models, but cross-validation testing showed that the reverse-direction model had better classification performance. Conclusions: A narrow focus on adolescent risk behaviors with a causal-inference framework can result in erroneous interpretation in the presence of many correlated risk factors. A wider predictive-inference perspective can help inform better screening strategies and potential intervention targets.
KW - Adolescents
KW - Cigarettes
KW - Electronic cigarettes
KW - Mental health
KW - Risk factors
KW - Risky behavior
KW - Substance use
UR - http://www.scopus.com/inward/record.url?scp=105006682716&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105006682716&partnerID=8YFLogxK
U2 - 10.1016/j.drugalcdep.2025.112716
DO - 10.1016/j.drugalcdep.2025.112716
M3 - Article
C2 - 40435593
AN - SCOPUS:105006682716
SN - 0376-8716
VL - 273
JO - Drug and alcohol dependence
JF - Drug and alcohol dependence
M1 - 112716
ER -