Value Learning through Reinforcement: The Basics of Dopamine and Reinforcement Learning

Nathaniel D. Daw, Philipe N. Tobler

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

In order to choose advantageously in many circumstances, the values of choice alternatives have to be learned from experience. We provide an introduction to theoretical and experimental work on reinforcement learning, that is, trial-and-error learning to obtain rewards or avoid punishments. We introduce one version, the temporal-difference learning model, and review evidence that its predictions relate to the firing properties of midbrain dopamine neurons and to activity recorded with functional neuroimaging in humans. We also present evidence that this computational and neurophysiological mechanism affects human and animal behavior in decision and conditioning tasks.

Original languageEnglish (US)
Title of host publicationNeuroeconomics
Subtitle of host publicationDecision Making and the Brain: Second Edition
PublisherElsevier Inc.
Pages283-298
Number of pages16
ISBN (Print)9780124160088
DOIs
StatePublished - Sep 2013

Keywords

  • Dopamine
  • Reinforcement learning

ASJC Scopus subject areas

  • Neuroscience(all)

Fingerprint Dive into the research topics of 'Value Learning through Reinforcement: The Basics of Dopamine and Reinforcement Learning'. Together they form a unique fingerprint.

Cite this