Reward prediction error neurons implement an efficient code for reward

Schultz, W., Dayan, P. & Montague, P. R. A neural substrate of prediction and reward. Science 275, 1593–1599 (1997).

Article CAS PubMed Google Scholar

Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction (MathWorks, 2018).

Balleine, B. W., Daw, N. D. & O’Doherty, J. P. in Neuroeconomics (eds Glimcher, P. W. et al.) 367–387 (Academic Press, 2009).

Attneave, F. Some informational aspects of visual perception. Psychol. Rev. 61, 183–193 (1954).

Article CAS PubMed Google Scholar

Barlow, H. B. in Sensory Communication (ed Rosenblith, W. A.) 216–234 (MIT Press, 1961).

Laughlin, S. A simple coding procedure enhances a neuron’s information capacity. Z. Naturforsch. C Biosci. 36, 910–912 (1981).

Article CAS PubMed Google Scholar

Schwartz, O. & Simoncelli, E. P. Natural signal statistics and sensory gain control. Nat. Neurosci. 4, 819–825 (2001).

Article CAS PubMed Google Scholar

Wei, X.-X. & Stocker, A. A. Lawful relation between perceptual bias and discriminability. Proc. Natl Acad. Sci. USA 114, 10244–10249 (2017).

Article CAS PubMed PubMed Central Google Scholar

Louie, K., Glimcher, P. W. & Webb, R. Adaptive neural coding: from biological to behavioral decision-making. Curr. Opin. Behav. Sci. 5, 91–99 (2015).

Article PubMed PubMed Central Google Scholar

Polanía, R., Woodford, M. & Ruff, C. C. Efficient coding of subjective value. Nat. Neurosci. 22, 134–142 (2019).

Article PubMed Google Scholar

Bhui, R., Lai, L. & Gershman, S. J. Resource-rational decision making. Curr. Opin. Behav. Sci. 41, 15–21 (2021).

Article Google Scholar

Louie, K. & Glimcher, P. W. Efficient coding and the neural representation of value. Ann. N Y Acad. Sci. 1251, 13–32 (2012).

Article PubMed Google Scholar

Motiwala, A., Soares, S., Atallah, B. V., Paton, J. J. & Machens, C. K. Efficient coding of cognitive variables underlies dopamine response and choice behavior. Nat. Neurosci. 25, 738–748 (2022).

Article CAS PubMed Google Scholar

Eshel, N. et al. Arithmetic and local circuitry underlying dopamine prediction errors. Nature 525, 243–246 (2015).

Article CAS PubMed PubMed Central Google Scholar

Eshel, N., Tian, J., Bukwich, M. & Uchida, N. Dopamine neurons share common response function for reward prediction error. Nat. Neurosci. 19, 479–486 (2016).

Article CAS PubMed PubMed Central Google Scholar

Dabney, W. et al. A distributional code for value in dopamine-based reinforcement learning. Nature 577, 671–675 (2020).

Article CAS PubMed PubMed Central Google Scholar

Rothenhoefer, K. M., Hong, T., Alikaya, A. & Stauffer, W. R. Rare rewards amplify dopamine responses. Nat. Neurosci. 24, 465–469 (2021).

Article CAS PubMed PubMed Central Google Scholar

Ganguli, D. & Simoncelli, E. P. Efficient sensory encoding and Bayesian inference with heterogeneous neural populations. Neural Comput. 26, 2103–2134 (2014).

Article PubMed PubMed Central Google Scholar

Fiorillo, C. D., Tobler, P. N. & Schultz, W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 299, 1898–1902 (2003).

Article CAS PubMed Google Scholar

Cohen, J. D. & Servan-Schreiber, D. A theory of dopamine function and its role in cognitive deficits in schizophrenia. Schizophr. Bull. 19, 85–104 (1993).

Article CAS PubMed Google Scholar

Wei, X.-X. & Stocker, A. A. Bayesian inference with efficient neural population codes. In Artificial Neural Networks and Machine Learning—ICANN 2012, Vol. 7552 (eds Hutchison, D. et al.) 523–530 (Springer, 2012).

Frank, M. J., Seeberger, L. C. & O’Reilly, R. C. By carrot or by stick: cognitive reinforcement learning in Parkinsonism. Science 306, 1940–1943 (2004).

Article CAS PubMed Google Scholar

Mikhael, J. G. & Bogacz, R. Learning reward uncertainty in the basal ganglia. PLoS Comput. Biol. 12, e1005062 (2016).

Article PubMed PubMed Central Google Scholar

Kobayashi, S. & Schultz, W. Influence of reward delays on responses of dopamine neurons. J. Neurosci. 28, 7837–7846 (2008).

Article CAS PubMed PubMed Central Google Scholar

Roesch, M. R., Calu, D. J. & Schoenbaum, G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nat. Neurosci. 10, 1615–1624 (2007).

Article CAS PubMed PubMed Central Google Scholar

Kim, H. R. et al. A unified framework for dopamine signals across timescales. Cell 183, 1600–1616 (2020).

Article Google Scholar

Starkweather, C. K. & Uchida, N. Dopamine signals as temporal difference errors: recent advances. Curr. Opin. Neurobiol. 67, 95–105 (2021).

Article CAS PubMed Google Scholar

Starkweather, C. K., Babayan, B. M., Uchida, N. & Gershman, S. J. Dopamine reward prediction errors reflect hidden-state inference across time. Nat. Neurosci. 20, 581–589 (2017).

Article CAS PubMed PubMed Central Google Scholar

Soares, S., Atallah, B. V. & Paton, J. J. Midbrain dopamine neurons control judgment of time. Science 354, 1273–1277 (2016).

Article CAS PubMed Google Scholar

Tano, P., Dayan, P. & Pouget, A. A local temporal difference code for distributional reinforcement learning. In Advances in Neural Information Processing Systems 33 (eds Larochelle, H. et al.) 13662–13673 (Neural Information Processing Systems Foundation, 2020).

Louie, K. Asymmetric and adaptive reward coding via normalized reinforcement learning. PLoS Comput. Biol. 18, e1010350 (2022).

Article CAS PubMed PubMed Central Google Scholar

Naka, K. I. & Rushton, W. A. H. An attempt to analyse colour reception by electrophysiology. J. Physiol. 185, 556–586 (1966).

Article CAS PubMed PubMed Central Google Scholar

Bredenberg, C., Simoncelli, E. P. & Savin, C. Learning efficient task-dependent representations with synaptic plasticity. In Advances in Neural Information Processing Systems 33 (eds Larochelle, H. et al.) 15714–15724 (Neural Information Processing Systems Foundation, 2020).

Savin, C. & Triesch, J. Emergence of task-dependent representations in working memory circuits. Front. Comput. Neurosci. 8, 57 (2014).

Article PubMed PubMed Central Google Scholar

Gerstner, W., Lehmann, M., Liakoni, V., Corneil, D. & Brea, J. Eligibility traces and plasticity on behavioral time scales: experimental support of neoHebbian three-factor learning rules. Front. Neural Circuits 12, 53 (2018).

Article PubMed PubMed Central Google Scholar

Frémaux, N. & Gerstner, W. Neuromodulated spike-timing-dependent plasticity, and theory of three-factor learning rules. Front. Neural Circuits 9, 85 (2016).

Article PubMed PubMed Central Google Scholar

Wei, X.-X. & Stocker, A. A. A Bayesian observer model constrained by efficient coding can explain ‘anti-Bayesian’ percepts. Nat. Neu

View original article

NATURE NEUROSCIENCE

Like

分享书签

0 0 0 0 0 0 0

More from this channel

Reward prediction error neurons implement an efficient code for reward

留言 (0)