Correlates of reward-predictive value in
โ
Murat Okatan
๐
Article
๐
2009
๐
John Wiley and Sons
๐
English
โ 675 KB
## Abstract Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here