TY - JOUR
T1 - Brain mechanism of foraging
T2 - Reward-dependent synaptic plasticity versus neural integration of values
AU - Pereira-Obilinovic, Ulises
AU - Hou, Han
AU - Svoboda, Karel
AU - Wang, Xiao Jing
N1 - Publisher Copyright:
Copyright © 2024 the Author(s). Published by PNAS. This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).
PY - 2024/4/2
Y1 - 2024/4/2
N2 - During foraging behavior, action values are persistently encoded in neural activity and updated depending on the history of choice outcomes. What is the neural mechanism for action value maintenance and updating? Here, we explore two contrasting network models: synaptic learning of action value versus neural integration. We show that both models can reproduce extant experimental data, but they yield distinct predictions about the underlying biological neural circuits. In particular, the neural integrator model but not the synaptic model requires that reward signals are mediated by neural pools selective for action alternatives and their projections are aligned with linear attractor axes in the valuation system. We demonstrate experimentally observable neural dynamical signatures and feasible perturbations to differentiate the two contrasting scenarios, suggesting that the synaptic model is a more robust candidate mechanism. Overall, this work provides a modeling framework to guide future experimental research on probabilistic foraging.
AB - During foraging behavior, action values are persistently encoded in neural activity and updated depending on the history of choice outcomes. What is the neural mechanism for action value maintenance and updating? Here, we explore two contrasting network models: synaptic learning of action value versus neural integration. We show that both models can reproduce extant experimental data, but they yield distinct predictions about the underlying biological neural circuits. In particular, the neural integrator model but not the synaptic model requires that reward signals are mediated by neural pools selective for action alternatives and their projections are aligned with linear attractor axes in the valuation system. We demonstrate experimentally observable neural dynamical signatures and feasible perturbations to differentiate the two contrasting scenarios, suggesting that the synaptic model is a more robust candidate mechanism. Overall, this work provides a modeling framework to guide future experimental research on probabilistic foraging.
KW - foraging
KW - neural integrator
KW - reinforcement learning
KW - value-based decision-making
UR - http://www.scopus.com/inward/record.url?scp=85189720081&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85189720081&partnerID=8YFLogxK
U2 - 10.1073/pnas.2318521121
DO - 10.1073/pnas.2318521121
M3 - Article
C2 - 38551832
AN - SCOPUS:85189720081
SN - 0027-8424
VL - 121
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 14
M1 - e2318521121
ER -