Quality and Accountability of ChatGPT in Health Care in Low- and Middle-Income Countries: Simulated Patient Study

Yafei Si, Yuyi Yang, Xi Wang, Jiaqi Zu, Xi Chen, Xiaojing Fan, Ruopeng An, Sen Gong

Research output: Contribution to journalArticlepeer-review

Abstract

Using simulated patients to mimic 9 established noncommunicable and infectious diseases, we assessed ChatGPT’s performance in treatment recommendations for common diseases in low- and middle-income countries. ChatGPT had a high level of accuracy in both correct diagnoses (20/27, 74%) and medication prescriptions (22/27, 82%) but a concerning level of unnecessary or harmful medications (23/27, 85%) even with correct diagnoses. ChatGPT performed better in managing noncommunicable diseases than infectious ones. These results highlight the need for cautious AI integration in health care systems to ensure quality and safety.

Original languageEnglish (US)
Article numbere56121
JournalJournal of medical Internet research
Volume26
DOIs
StatePublished - 2024

Keywords

  • AI
  • AI integration
  • ChatGPT
  • LMIC
  • artificial intelligence
  • effectiveness
  • generative AI
  • health care
  • low- and middle-income countries
  • medication prescription
  • noncommunicable diseases
  • patient study
  • prescription
  • quality
  • quality and safety
  • reliability
  • simulated patient

ASJC Scopus subject areas

  • Health Informatics

Fingerprint

Dive into the research topics of 'Quality and Accountability of ChatGPT in Health Care in Low- and Middle-Income Countries: Simulated Patient Study'. Together they form a unique fingerprint.

Cite this