TY - JOUR
T1 - Build neural network models to identify and correct news headlines exaggerating obesity-related scientific findings
AU - An, Ruopeng
AU - Batcheller, Quinlan
AU - Wang, Junjie
AU - Yang, Yuyi
N1 - Publisher Copyright:
© 2023 Ruopeng An et al., published by Sciendo.
PY - 2023/6
Y1 - 2023/6
N2 - Purpose: Media exaggerations of health research may confuse readers' understanding, erode public trust in science and medicine, and cause disease mismanagement. This study built artificial intelligence (AI) models to automatically identify and correct news headlines exaggerating obesity-related research findings. Design/methodology/approach: We searched popular digital media outlets to collect 523 headlines exaggerating obesity-related research findings. The reasons for exaggerations include: inferring causality from observational studies, inferring human outcomes from animal research, inferring distant/end outcomes (e.g., obesity) from immediate/intermediate outcomes (e.g., calorie intake), and generalizing findings to the population from a subgroup or convenience sample. Each headline was paired with the title and abstract of the peer-reviewed journal publication covered by the news article. We drafted an exaggeration-free counterpart for each original headline and fined-Tuned a BERT model to differentiate between them. We further fine-Tuned three generative language models-BART, PEGASUS, and T5 to autogenerate exaggeration-free headlines based on a journal publication's title and abstract. Model performance was evaluated using the ROUGE metrics by comparing model-generated headlines with journal publication titles. Findings: The fine-Tuned BERT model achieved 92.5% accuracy in differentiating between exaggeration-free and original headlines. Baseline ROUGE scores averaged 0.311 for ROUGE-1, 0.113 for ROUGE-2, 0.253 for ROUGE-L, and 0.253 ROUGE-Lsum. PEGASUS, T5, and BART all outperformed the baseline. The best-performing BART model attained 0.447 for ROUGE-1, 0.221 for ROUGE-2, 0.402 for ROUGE-L, and 0.402 for ROUGE-Lsum. Originality/value: This study demonstrated the feasibility of leveraging AI to automatically identify and correct news headlines exaggerating obesity-related research findings.
AB - Purpose: Media exaggerations of health research may confuse readers' understanding, erode public trust in science and medicine, and cause disease mismanagement. This study built artificial intelligence (AI) models to automatically identify and correct news headlines exaggerating obesity-related research findings. Design/methodology/approach: We searched popular digital media outlets to collect 523 headlines exaggerating obesity-related research findings. The reasons for exaggerations include: inferring causality from observational studies, inferring human outcomes from animal research, inferring distant/end outcomes (e.g., obesity) from immediate/intermediate outcomes (e.g., calorie intake), and generalizing findings to the population from a subgroup or convenience sample. Each headline was paired with the title and abstract of the peer-reviewed journal publication covered by the news article. We drafted an exaggeration-free counterpart for each original headline and fined-Tuned a BERT model to differentiate between them. We further fine-Tuned three generative language models-BART, PEGASUS, and T5 to autogenerate exaggeration-free headlines based on a journal publication's title and abstract. Model performance was evaluated using the ROUGE metrics by comparing model-generated headlines with journal publication titles. Findings: The fine-Tuned BERT model achieved 92.5% accuracy in differentiating between exaggeration-free and original headlines. Baseline ROUGE scores averaged 0.311 for ROUGE-1, 0.113 for ROUGE-2, 0.253 for ROUGE-L, and 0.253 ROUGE-Lsum. PEGASUS, T5, and BART all outperformed the baseline. The best-performing BART model attained 0.447 for ROUGE-1, 0.221 for ROUGE-2, 0.402 for ROUGE-L, and 0.402 for ROUGE-Lsum. Originality/value: This study demonstrated the feasibility of leveraging AI to automatically identify and correct news headlines exaggerating obesity-related research findings.
KW - Artificial intelligence
KW - Deep neural networks
KW - Exaggeration
KW - Headlines
KW - News
KW - Obesity
UR - http://www.scopus.com/inward/record.url?scp=85163384245&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85163384245&partnerID=8YFLogxK
U2 - 10.2478/jdis-2023-0014
DO - 10.2478/jdis-2023-0014
M3 - Article
AN - SCOPUS:85163384245
SN - 2096-157X
JO - Journal of Data and Information Science
JF - Journal of Data and Information Science
ER -