TY - JOUR
T1 - The ubiquity of model-based reinforcement learning
AU - Doll, Bradley B.
AU - Simon, Dylan A.
AU - Daw, Nathaniel D.
N1 - Publisher Copyright:
© 2012 Elsevier Ltd.
PY - 2012/12/1
Y1 - 2012/12/1
N2 - The reward prediction error (RPE) theory of dopamine (DA) function has enjoyed great success in the neuroscience of learning and decision-making. This theory is derived from model-free reinforcement learning (RL), in which choices are made simply on the basis of previously realized rewards. Recently, attention has turned to correlates of more flexible, albeit computationally complex, model-based methods in the brain. These methods are distinguished from model-free learning by their evaluation of candidate actions using expected future outcomes according to a world model. Puzzlingly, signatures from these computations seem to be pervasive in the very same regions previously thought to support model-free learning. Here, we review recent behavioral and neural evidence about these two systems, in attempt to reconcile their enigmatic cohabitation in the brain.
AB - The reward prediction error (RPE) theory of dopamine (DA) function has enjoyed great success in the neuroscience of learning and decision-making. This theory is derived from model-free reinforcement learning (RL), in which choices are made simply on the basis of previously realized rewards. Recently, attention has turned to correlates of more flexible, albeit computationally complex, model-based methods in the brain. These methods are distinguished from model-free learning by their evaluation of candidate actions using expected future outcomes according to a world model. Puzzlingly, signatures from these computations seem to be pervasive in the very same regions previously thought to support model-free learning. Here, we review recent behavioral and neural evidence about these two systems, in attempt to reconcile their enigmatic cohabitation in the brain.
UR - http://www.scopus.com/inward/record.url?scp=84872761547&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84872761547&partnerID=8YFLogxK
U2 - 10.1016/j.conb.2012.08.003
DO - 10.1016/j.conb.2012.08.003
M3 - Review article
C2 - 22959354
AN - SCOPUS:84872761547
SN - 0959-4388
VL - 22
SP - 1075
EP - 1081
JO - Current Opinion in Neurobiology
JF - Current Opinion in Neurobiology
IS - 6
ER -