TY - JOUR
T1 - Exploiting affinities between topic modeling and the sociological perspective on culture
T2 - Application to newspaper coverage of U.S. government arts funding
AU - DiMaggio, Paul
AU - Nag, Manish
AU - Blei, David
N1 - Funding Information:
Research support from Princeton University and sabbatical support for the first author from the Russell Sage Foundation are gratefully acknowledged, as is the assistance of Brian Steensland in gathering parts of the textual data. Support from the Rockefeller Foundation and the Andrew W. Mellon Foundation (through a grant to Princeton's Center for Arts and Cultural Policy Studies) for data collection, research and sabbatical support from Princeton University and the Russell Sage Foundation and research support from Princeton's Center for Information Technology and Public Policy are gratefully acknowledged. Thoughtful feedback from Amy Binder, Clayton Childress, Edward Hunter, the editor, special issue editor, and reviewers for Poetics, and participants in the University of Pennsylvania's Economic Sociology Colloquium and Princeton University's Theorodology Workshop is gratefully acknowledged.
PY - 2013/12
Y1 - 2013/12
N2 - Topic modeling provides a valuable method for identifying the linguistic contexts that surround social institutions or policy domains. This article uses Latent Dirichlet Allocation (LDA) to analyze how one such policy domain, government assistance to artists and arts organizations, was framed in almost 8000 articles. These comprised all articles that referred to government support for the arts in the U.S. published in five U.S. newspapers between 1986 and 1997-a period during which such assistance, once noncontroversial, became a focus of contention. We illustrate the strengths of topic modeling as a means of analyzing large text corpora, discuss the proper choice of models and interpretation of model results, describe means of validating topic-model solutions, and demonstrate the use of topic models in combination with other statistical tools to estimate differences between newspapers in the prevalence of different frames. Throughout, we emphasize affinities between the topic-modeling approach and such central concepts in the study of culture as framing, polysemy, heteroglossia, and the relationality of meaning.
AB - Topic modeling provides a valuable method for identifying the linguistic contexts that surround social institutions or policy domains. This article uses Latent Dirichlet Allocation (LDA) to analyze how one such policy domain, government assistance to artists and arts organizations, was framed in almost 8000 articles. These comprised all articles that referred to government support for the arts in the U.S. published in five U.S. newspapers between 1986 and 1997-a period during which such assistance, once noncontroversial, became a focus of contention. We illustrate the strengths of topic modeling as a means of analyzing large text corpora, discuss the proper choice of models and interpretation of model results, describe means of validating topic-model solutions, and demonstrate the use of topic models in combination with other statistical tools to estimate differences between newspapers in the prevalence of different frames. Throughout, we emphasize affinities between the topic-modeling approach and such central concepts in the study of culture as framing, polysemy, heteroglossia, and the relationality of meaning.
KW - Content analysis
KW - Heteroglossia
KW - Meaning
KW - National Endowment for the Arts
KW - Polysemy
KW - Topic models
UR - http://www.scopus.com/inward/record.url?scp=84888133421&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84888133421&partnerID=8YFLogxK
U2 - 10.1016/j.poetic.2013.08.004
DO - 10.1016/j.poetic.2013.08.004
M3 - Article
AN - SCOPUS:84888133421
SN - 0304-422X
VL - 41
SP - 570
EP - 606
JO - Poetics
JF - Poetics
IS - 6
ER -