TY - JOUR
T1 - The emergence of economic rationality of GPT
AU - Chen, Yiting
AU - Liu, Tracy Xiao
AU - Shan, You
AU - Zhong, Songfa
N1 - Publisher Copyright:
© 2023 the Author(s).
PY - 2023
Y1 - 2023
N2 - As large language models (LLMs) like GPT become increasingly prevalent, it is essential that we assess their capabilities beyond language processing. This paper examines the economic rationality of GPT by instructing it to make budgetary decisions in four domains: risk, time, social, and food preferences. We measure economic rationality by assessing the consistency of GPT's decisions with utility maximization in classic revealed preference theory. We find that GPT's decisions are largely rational in each domain and demonstrate higher rationality score than those of human subjects in a parallel experiment and in the literature. Moreover, the estimated preference parameters of GPT are slightly different from human subjects and exhibit a lower degree of heterogeneity. We also find that the rationality scores are robust to the degree of randomness and demographic settings such as age and gender but are sensitive to contexts based on the language frames of the choice situations. These results suggest the potential of LLMs to make good decisions and the need to further understand their capabilities, limitations, and underlying mechanisms.
AB - As large language models (LLMs) like GPT become increasingly prevalent, it is essential that we assess their capabilities beyond language processing. This paper examines the economic rationality of GPT by instructing it to make budgetary decisions in four domains: risk, time, social, and food preferences. We measure economic rationality by assessing the consistency of GPT's decisions with utility maximization in classic revealed preference theory. We find that GPT's decisions are largely rational in each domain and demonstrate higher rationality score than those of human subjects in a parallel experiment and in the literature. Moreover, the estimated preference parameters of GPT are slightly different from human subjects and exhibit a lower degree of heterogeneity. We also find that the rationality scores are robust to the degree of randomness and demographic settings such as age and gender but are sensitive to contexts based on the language frames of the choice situations. These results suggest the potential of LLMs to make good decisions and the need to further understand their capabilities, limitations, and underlying mechanisms.
KW - decision-making
KW - economic rationality
KW - large language models
KW - revealed preference analysis
UR - http://www.scopus.com/inward/record.url?scp=85179643931&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85179643931&partnerID=8YFLogxK
U2 - 10.1073/pnas.2316205120
DO - 10.1073/pnas.2316205120
M3 - Article
C2 - 38085780
AN - SCOPUS:85179643931
SN - 0027-8424
VL - 120
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 51
M1 - e2316205120
ER -