Generative AI Meets Open-Ended Survey Responses: Research Participant Use of AI and Homogenization

Simone Zhang, Janet Xu, A. J. Alvero

    Research output: Contribution to journalArticlepeer-review

    Abstract

    The growing popularity of generative artificial intelligence (AI) tools presents new challenges for data quality in online surveys and experiments. This study examines participants’ use of large language models to answer open-ended survey questions and describes empirical tendencies in human versus large language model (LLM)-generated text responses. In an original survey of research participants recruited from a popular online platform for sourcing social science research subjects, 34 percent reported using LLMs to help them answer open-ended survey questions. Simulations comparing human-written responses from three pre-ChatGPT studies with LLM-generated text reveal that LLM responses are more homogeneous and positive, particularly when they describe social groups in sensitive questions. These homogenization patterns may mask important underlying social variation in attitudes and beliefs among human subjects, raising concerns about data validity. Our findings shed light on the scope and potential consequences of participants’ LLM use in online research.

    Original languageEnglish (US)
    Article number00491241251327130
    JournalSociological Methods and Research
    DOIs
    StateAccepted/In press - 2025

    Keywords

    • data quality
    • generative AI
    • human-AI interaction
    • large language models
    • surveys

    ASJC Scopus subject areas

    • Social Sciences (miscellaneous)
    • Sociology and Political Science

    Fingerprint

    Dive into the research topics of 'Generative AI Meets Open-Ended Survey Responses: Research Participant Use of AI and Homogenization'. Together they form a unique fingerprint.

    Cite this