TY - JOUR
T1 - A recurrent network model of planning explains hippocampal replay and human behavior
AU - Jensen, Kristopher T.
AU - Hennequin, Guillaume
AU - Mattar, Marcelo G.
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/7
Y1 - 2024/7
N2 - When faced with a novel situation, people often spend substantial periods of time contemplating possible futures. For such planning to be rational, the benefits to behavior must compensate for the time spent thinking. Here, we capture these features of behavior by developing a neural network model where planning itself is controlled by the prefrontal cortex. This model consists of a meta-reinforcement learning agent augmented with the ability to plan by sampling imagined action sequences from its own policy, which we call ‘rollouts’. In a spatial navigation task, the agent learns to plan when it is beneficial, which provides a normative explanation for empirical variability in human thinking times. Additionally, the patterns of policy rollouts used by the artificial agent closely resemble patterns of rodent hippocampal replays. Our work provides a theory of how the brain could implement planning through prefrontal–hippocampal interactions, where hippocampal replays are triggered by—and adaptively affect—prefrontal dynamics.
AB - When faced with a novel situation, people often spend substantial periods of time contemplating possible futures. For such planning to be rational, the benefits to behavior must compensate for the time spent thinking. Here, we capture these features of behavior by developing a neural network model where planning itself is controlled by the prefrontal cortex. This model consists of a meta-reinforcement learning agent augmented with the ability to plan by sampling imagined action sequences from its own policy, which we call ‘rollouts’. In a spatial navigation task, the agent learns to plan when it is beneficial, which provides a normative explanation for empirical variability in human thinking times. Additionally, the patterns of policy rollouts used by the artificial agent closely resemble patterns of rodent hippocampal replays. Our work provides a theory of how the brain could implement planning through prefrontal–hippocampal interactions, where hippocampal replays are triggered by—and adaptively affect—prefrontal dynamics.
UR - http://www.scopus.com/inward/record.url?scp=85195375876&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85195375876&partnerID=8YFLogxK
U2 - 10.1038/s41593-024-01675-7
DO - 10.1038/s41593-024-01675-7
M3 - Article
C2 - 38849521
AN - SCOPUS:85195375876
SN - 1097-6256
VL - 27
SP - 1340
EP - 1348
JO - Nature Neuroscience
JF - Nature Neuroscience
IS - 7
ER -