Literature search services are currently unavailable. During our hosting provider's UPS upgrade we experienced a hardware failure and are currently working to resolve the issue.

Preparing your results

Our searching services are busy right now. Your search will reload in five seconds.

Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Temporal difference models and reward-related learning in the human brain.

Temporal difference learning has been proposed as a model for Pavlovian conditioning, in which an animal learns to predict delivery of reward following presentation of a conditioned stimulus (CS). A key component of this model is a prediction error signal, which, before learning, responds at the time of presentation of reward but, after learning, shifts its response to the time of onset of the CS. In order to test for regions manifesting this signal profile, subjects were scanned using event-related fMRI while undergoing appetitive conditioning with a pleasant taste reward. Regression analyses revealed that responses in ventral striatum and orbitofrontal cortex were significantly correlated with this error signal, suggesting that, during appetitive conditioning, computations described by temporal difference learning are expressed in the human brain.

Pubmed ID: 12718865


  • O'Doherty JP
  • Dayan P
  • Friston K
  • Critchley H
  • Dolan RJ



Publication Data

April 24, 2003

Associated Grants


Mesh Terms

  • Adolescent
  • Adult
  • Brain
  • Brain Mapping
  • Conditioning, Classical
  • Corpus Striatum
  • Female
  • Frontal Lobe
  • Humans
  • Learning
  • Magnetic Resonance Imaging
  • Male
  • Reference Values
  • Reflex, Pupillary
  • Reward
  • Taste
  • Time Perception