Social Determinants in Machine Learning Cardiovascular Disease Prediction Models: A Systematic Review

Yuan Zhao, Erica P. Wood, Nicholas Mirin, Stephanie H. Cook, Rumi Chunara

Research output: Contribution to journalReview articlepeer-review


Introduction: Cardiovascular disease is the leading cause of death worldwide, and cardiovascular disease burden is increasing in low-resource settings and for lower socioeconomic groups. Machine learning algorithms are being developed rapidly and incorporated into clinical practice for cardiovascular disease prediction and treatment decisions. Significant opportunities for reducing death and disability from cardiovascular disease worldwide lie with accounting for the social determinants of cardiovascular outcomes. This study reviews how social determinants of health are being included in machine learning algorithms to inform best practices for the development of algorithms that account for social determinants. Methods: A systematic review using 5 databases was conducted in 2020. English language articles from any location published from inception to April 10, 2020, which reported on the use of machine learning for cardiovascular disease prediction that incorporated social determinants of health, were included. Results: Most studies that compared machine learning algorithms and regression showed increased performance of machine learning, and most studies that compared performance with or without social determinants of health showed increased performance with them. The most frequently included social determinants of health variables were gender, race/ethnicity, marital status, occupation, and income. Studies were largely from North America, Europe, and China, limiting the diversity of the included populations and variance in social determinants of health. Discussion: Given their flexibility, machine learning approaches may provide an opportunity to incorporate the complex nature of social determinants of health. The limited variety of sources and data in the reviewed studies emphasize that there is an opportunity to include more social determinants of health variables, especially environmental ones, that are known to impact cardiovascular disease risk and that recording such data in electronic databases will enable their use.

Original languageEnglish (US)
Pages (from-to)596-605
Number of pages10
JournalAmerican journal of preventive medicine
Issue number4
StatePublished - Oct 2021

ASJC Scopus subject areas

  • Epidemiology
  • Public Health, Environmental and Occupational Health


Dive into the research topics of 'Social Determinants in Machine Learning Cardiovascular Disease Prediction Models: A Systematic Review'. Together they form a unique fingerprint.

Cite this