Machine Learning for Sociology

Mario Molina, Filiz Garip

Research output: Contribution to journalArticlepeer-review


Machine learning is a field at the intersection of statistics and computer science that uses algorithms to extract information and knowledge from data. Its applications increasingly find their way into economics, political science, and sociology. We offer a brief introduction to this vast toolbox and illustrate its current uses in the social sciences, including distilling measures from new data sources, such as text and images; characterizing population heterogeneity; improving causal inference; and offering predictions to aid policy decisions and theory development. We argue that, in addition to serving similar purposes in sociology, machine learning tools can speak to long-standing questions on the limitations of the linear modeling framework, the criteria for evaluating empirical findings, transparency around the context of discovery, and the epistemological core of the discipline.

Original languageEnglish (US)
Pages (from-to)27-45
Number of pages19
JournalAnnual Review of Sociology
StatePublished - Jul 30 2019


  • causal inference
  • discovery
  • heterogeneity
  • prediction
  • Supervised learning
  • unsupervised learning

ASJC Scopus subject areas

  • Sociology and Political Science


Dive into the research topics of 'Machine Learning for Sociology'. Together they form a unique fingerprint.

Cite this