Reward prediction error neurons implement an efficient code for reward

Schultz, W., Dayan, P. & Montague, P. R. A neural substrate of prediction and reward. Science 275, 1593–1599 (1997).

Article  CAS  PubMed  Google Scholar 

Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction (MathWorks, 2018).

Balleine, B. W., Daw, N. D. & O’Doherty, J. P. in Neuroeconomics (eds Glimcher, P. W. et al.) 367–387 (Academic Press, 2009).

Attneave, F. Some informational aspects of visual perception. Psychol. Rev. 61, 183–193 (1954).

Article  CAS  PubMed  Google Scholar 

Barlow, H. B. in Sensory Communication (ed Rosenblith, W. A.) 216–234 (MIT Press, 1961).

Laughlin, S. A simple coding procedure enhances a neuron’s information capacity. Z. Naturforsch. C Biosci. 36, 910–912 (1981).

Article  CAS  PubMed  Google Scholar 

Schwartz, O. & Simoncelli, E. P. Natural signal statistics and sensory gain control. Nat. Neurosci. 4, 819–825 (2001).

Article  CAS  PubMed  Google Scholar 

Wei, X.-X. & Stocker, A. A. Lawful relation between perceptual bias and discriminability. Proc. Natl Acad. Sci. USA 114, 10244–10249 (2017).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Louie, K., Glimcher, P. W. & Webb, R. Adaptive neural coding: from biological to behavioral decision-making. Curr. Opin. Behav. Sci. 5, 91–99 (2015).

Article  PubMed  PubMed Central  Google Scholar 

Polanía, R., Woodford, M. & Ruff, C. C. Efficient coding of subjective value. Nat. Neurosci. 22, 134–142 (2019).

Article  PubMed  Google Scholar 

Bhui, R., Lai, L. & Gershman, S. J. Resource-rational decision making. Curr. Opin. Behav. Sci. 41, 15–21 (2021).

Article  Google Scholar 

Louie, K. & Glimcher, P. W. Efficient coding and the neural representation of value. Ann. N Y Acad. Sci. 1251, 13–32 (2012).

Article  PubMed  Google Scholar 

Motiwala, A., Soares, S., Atallah, B. V., Paton, J. J. & Machens, C. K. Efficient coding of cognitive variables underlies dopamine response and choice behavior. Nat. Neurosci. 25, 738–748 (2022).

Article  CAS  PubMed  Google Scholar 

Eshel, N. et al. Arithmetic and local circuitry underlying dopamine prediction errors. Nature 525, 243–246 (2015).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Eshel, N., Tian, J., Bukwich, M. & Uchida, N. Dopamine neurons share common response function for reward prediction error. Nat. Neurosci. 19, 479–486 (2016).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Dabney, W. et al. A distributional code for value in dopamine-based reinforcement learning. Nature 577, 671–675 (2020).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Rothenhoefer, K. M., Hong, T., Alikaya, A. & Stauffer, W. R. Rare rewards amplify dopamine responses. Nat. Neurosci. 24, 465–469 (2021).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Ganguli, D. & Simoncelli, E. P. Efficient sensory encoding and Bayesian inference with heterogeneous neural populations. Neural Comput. 26, 2103–2134 (2014).

Article  PubMed  PubMed Central  Google Scholar 

Fiorillo, C. D., Tobler, P. N. & Schultz, W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 299, 1898–1902 (2003).

Article  CAS  PubMed  Google Scholar 

Cohen, J. D. & Servan-Schreiber, D. A theory of dopamine function and its role in cognitive deficits in schizophrenia. Schizophr. Bull. 19, 85–104 (1993).

Article  CAS  PubMed  Google Scholar 

Wei, X.-X. & Stocker, A. A. Bayesian inference with efficient neural population codes. In Artificial Neural Networks and Machine Learning—ICANN 2012, Vol. 7552 (eds Hutchison, D. et al.) 523–530 (Springer, 2012).

Frank, M. J., Seeberger, L. C. & O’Reilly, R. C. By carrot or by stick: cognitive reinforcement learning in Parkinsonism. Science 306, 1940–1943 (2004).

Article  CAS  PubMed  Google Scholar 

Mikhael, J. G. & Bogacz, R. Learning reward uncertainty in the basal ganglia. PLoS Comput. Biol. 12, e1005062 (2016).

Article  PubMed  PubMed Central  Google Scholar 

Kobayashi, S. & Schultz, W. Influence of reward delays on responses of dopamine neurons. J. Neurosci. 28, 7837–7846 (2008).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Roesch, M. R., Calu, D. J. & Schoenbaum, G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nat. Neurosci. 10, 1615–1624 (2007).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Kim, H. R. et al. A unified framework for dopamine signals across timescales. Cell 183, 1600–1616 (2020).

Article  Google Scholar 

Starkweather, C. K. & Uchida, N. Dopamine signals as temporal difference errors: recent advances. Curr. Opin. Neurobiol. 67, 95–105 (2021).

Article  CAS  PubMed  Google Scholar 

Starkweather, C. K., Babayan, B. M., Uchida, N. & Gershman, S. J. Dopamine reward prediction errors reflect hidden-state inference across time. Nat. Neurosci. 20, 581–589 (2017).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Soares, S., Atallah, B. V. & Paton, J. J. Midbrain dopamine neurons control judgment of time. Science 354, 1273–1277 (2016).

Article  CAS  PubMed  Google Scholar 

Tano, P., Dayan, P. & Pouget, A. A local temporal difference code for distributional reinforcement learning. In Advances in Neural Information Processing Systems 33 (eds Larochelle, H. et al.) 13662–13673 (Neural Information Processing Systems Foundation, 2020).

Louie, K. Asymmetric and adaptive reward coding via normalized reinforcement learning. PLoS Comput. Biol. 18, e1010350 (2022).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Naka, K. I. & Rushton, W. A. H. An attempt to analyse colour reception by electrophysiology. J. Physiol. 185, 556–586 (1966).

Article  CAS  PubMed  PubMed Central  Google Scholar 

Bredenberg, C., Simoncelli, E. P. & Savin, C. Learning efficient task-dependent representations with synaptic plasticity. In Advances in Neural Information Processing Systems 33 (eds Larochelle, H. et al.) 15714–15724 (Neural Information Processing Systems Foundation, 2020).

Savin, C. & Triesch, J. Emergence of task-dependent representations in working memory circuits. Front. Comput. Neurosci. 8, 57 (2014).

Article  PubMed  PubMed Central  Google Scholar 

Gerstner, W., Lehmann, M., Liakoni, V., Corneil, D. & Brea, J. Eligibility traces and plasticity on behavioral time scales: experimental support of neoHebbian three-factor learning rules. Front. Neural Circuits 12, 53 (2018).

Article  PubMed  PubMed Central  Google Scholar 

Frémaux, N. & Gerstner, W. Neuromodulated spike-timing-dependent plasticity, and theory of three-factor learning rules. Front. Neural Circuits 9, 85 (2016).

Article  PubMed  PubMed Central  Google Scholar 

Wei, X.-X. & Stocker, A. A. A Bayesian observer model constrained by efficient coding can explain ‘anti-Bayesian’ percepts. Nat. Neu

留言 (0)

沒有登入
gif