Using natural language processing to analyse text data in behavioural science

Stefan Feuerriegel, Abdurahman Maarouf, Dominik Bär, Dominique Geissler, Jonas Schweisthal, Nicolas Pröllochs, Claire E. Robertson, Steve Rathje, Jochen Hartmann, Saif M. Mohammad, Oded Netzer, Alexandra A. Siegel, Barbara Plank, Jay J. Van Bavel

Research output: Contribution to journalReview articlepeer-review

Abstract

Language is a uniquely human trait at the core of human interactions. The language people use often reflects their personality, intentions and state of mind. With the integration of the Internet and social media into everyday life, much of human communication is documented as written text. These online forms of communication (for example, blogs, reviews, social media posts and emails) provide a window into human behaviour and therefore present abundant research opportunities for behavioural science. In this Review, we describe how natural language processing (NLP) can be used to analyse text data in behavioural science. First, we review applications of text data in behavioural science. Second, we describe the NLP pipeline and explain the underlying modelling approaches (for example, dictionary-based approaches and large language models). We discuss the advantages and disadvantages of these methods for behavioural science, in particular with respect to the trade-off between interpretability and accuracy. Finally, we provide actionable recommendations for using NLP to ensure rigour and reproducibility.

Original languageEnglish (US)
Article numbere2024292118
Pages (from-to)96-111
JournalNature Reviews Psychology
Volume4
Issue number2
DOIs
StatePublished - Feb 2025

ASJC Scopus subject areas

  • Psychology (miscellaneous)
  • Clinical Psychology
  • Developmental and Educational Psychology

Fingerprint

Dive into the research topics of 'Using natural language processing to analyse text data in behavioural science'. Together they form a unique fingerprint.

Cite this