My bibliography Save this paper

Neural and computational underpinnings of biased confidence in human reinforcement learning

Author

Listed:

Chih-Chung Ting
(UHH - Universität Hamburg)
Nahuel Salem-Garcia
(CISA - Swiss Center for Affective Sciences - UNIGE - Université de Genève = University of Geneva)
Stefano Palminteri
(ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres)
Jan Engelmann
(ASE - Amsterdam School of Economics - UvA - University of Amsterdam [Amsterdam] = Universiteit van Amsterdam)
Maël Lebreton
(CISA - Swiss Center for Affective Sciences - UNIGE - Université de Genève = University of Geneva, PSE - Paris School of Economics - UP1 - Université Paris 1 Panthéon-Sorbonne - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres - EHESS - École des hautes études en sciences sociales - ENPC - École des Ponts ParisTech - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, PJSE - Paris Jourdan Sciences Economiques - UP1 - Université Paris 1 Panthéon-Sorbonne - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres - EHESS - École des hautes études en sciences sociales - ENPC - École des Ponts ParisTech - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement)

Abstract

While navigating a fundamentally uncertain world, humans and animals constantly evaluate the probability of their decisions, actions or statements being correct. When explicitly elicited, these confidence estimates typically correlates positively with neural activity in a ventromedial-prefrontal (VMPFC) network and negatively in a dorsolateral and dorsomedial prefrontal network. Here, combining fMRI with a reinforcement-learning paradigm, we leverage the fact that humans are more confident in their choices when seeking gains than avoiding losses to reveal a functional dissociation: whereas the dorsal prefrontal network correlates negatively with a condition-specific confidence signal, the VMPFC network positively encodes task-wide confidence signal incorporating the valence-induced bias. Challenging dominant neuro-computational models, we found that decision-related VMPFC activity better correlates with confidence than with option-values inferred from reinforcement-learning models. Altogether, these results identify the VMPFC as a key node in the neuro-computational architecture that builds global feeling-of-confidence signals from latent decision variables and contextual biases during reinforcement-learning.

Suggested Citation

Chih-Chung Ting & Nahuel Salem-Garcia & Stefano Palminteri & Jan Engelmann & Maël Lebreton, 2023. "Neural and computational underpinnings of biased confidence in human reinforcement learning," PSE-Ecole d'économie de Paris (Postprint) halshs-04409145, HAL.

Handle: RePEc:hal:pseptp:halshs-04409145
DOI: 10.1038/s41467-023-42589-5

Download full text from publisher

To our knowledge, this item is not available for download. To find whether it is available, there are three options:
1. Check below whether another version of this item is available online.
2. Check on the provider's web page whether it is in fact available.
3. Perform a search for a similarly titled item that would be available.

Other versions of this item:

Chih-Chung Ting & Nahuel Salem-Garcia & Stefano Palminteri & Jan B. Engelmann & Maël Lebreton, 2023. "Neural and computational underpinnings of biased confidence in human reinforcement learning," Nature Communications, Nature, vol. 14(1), pages 1-18, December.

Chih-Chung Ting & Nahuel Salem-Garcia & Stefano Palminteri & Jan Engelmann & Maël Lebreton, 2023. "Neural and computational underpinnings of biased confidence in human reinforcement learning," Post-Print halshs-04409145, HAL.

References listed on IDEAS

Stefano Palminteri & Mehdi Khamassi & Mateus Joffily & Giorgio Coricelli, 2015. "Contextual modulation of value signals in reward and punishment learning," Nature Communications, Nature, vol. 6(1), pages 1-14, November.
Maël Lebreton & Sophie Bavard & Jean Daunizeau & Stefano Palminteri, 2019. "Assessing inter-individual differences with task-related functional neuroimaging," Nature Human Behaviour, Nature, vol. 3(9), pages 897-905, September.
Jean Daunizeau & Vincent Adam & Lionel Rigoux, 2014. "VBA: A Probabilistic Treatment of Nonlinear Models for Neurobiological and Behavioural Data," PLOS Computational Biology, Public Library of Science, vol. 10(1), pages 1-16, January.
Marine Hainguerlot & Jean-Christophe Vergnaud & Vincent de Gardelle, 2018. "Metacognitive ability predicts learning cue-stimulus associations in the absence of external feedback," PSE-Ecole d'économie de Paris (Postprint) hal-01761531, HAL.
- Marine Hainguerlot & Jean-Christophe Vergnaud & Vincent de Gardelle, 2018. "Metacognitive ability predicts learning cue-stimulus associations in the absence of external feedback," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-01761531, HAL.
- Marine Hainguerlot & Jean-Christophe Vergnaud & Vincent de Gardelle, 2018. "Metacognitive ability predicts learning cue-stimulus associations in the absence of external feedback," Post-Print hal-01761531, HAL.
Sophie Bavard & Maël Lebreton & Mehdi Khamassi & Giorgio Coricelli & Stefano Palminteri, 2018. "Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences," Nature Communications, Nature, vol. 9(1), pages 1-12, December.
Guillaume Hollard & Sébastien Massoni & Jean-Christophe Vergnaud, 2016. "In search of good probability assessors: an experimental comparison of elicitation rules for confidence judgments," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-01306258, HAL.
- Guillaume Hollard & Sébastien Massoni & Jean-Christophe Vergnaud, 2016. "In search of good probability assessors: an experimental comparison of elicitation rules for confidence judgments," Post-Print hal-01306258, HAL.
Karl Schlag & James Tremewan & Joël Weele, 2015. "A penny for your thoughts: a survey of methods for eliciting beliefs," Experimental Economics, Springer;Economic Science Association, vol. 18(3), pages 457-490, September.
- Karl Schlag & James Tremewan & Joel von der Weele, 2014. "A Penny for your Thoughts: A Survey of Methods of Eliciting Beliefs," Vienna Economics Papers vie1401, University of Vienna, Department of Economics.
Nadescha Trudel & Jacqueline Scholl & Miriam C. Klein-Flügge & Elsa Fouragnan & Lev Tankelevitch & Marco K. Wittmann & Matthew F. S. Rushworth, 2021. "Polarity of uncertainty representation during exploration and exploitation in ventromedial prefrontal cortex," Nature Human Behaviour, Nature, vol. 5(1), pages 83-98, January.
Guillaume Hollard & Sébastien Massoni & Jean-Christophe Vergnaud, 2016. "In search of good probability assessors: an experimental comparison of elicitation rules for confidence judgments," Theory and Decision, Springer, vol. 80(3), pages 363-387, March.
- Guillaume Hollard & Sébastien Massoni & Jean-Christophe Vergnaud, 2016. "In search of good probability assessors: an experimental comparison of elicitation rules for confidence judgments," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-01306258, HAL.
- Guillaume Hollard & Sébastien Massoni & Jean-Christophe Vergnaud, 2016. "In search of good probability assessors: an experimental comparison of elicitation rules for confidence judgments," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-01387553, HAL.
Germain Lefebvre & Maël Lebreton & Florent Meyniel & Sacha Bourgeois-Gironde & Stefano Palminteri, 2017. "Behavioural and neural characterization of optimistic reinforcement learning," Nature Human Behaviour, Nature, vol. 1(4), pages 1-9, April.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Maël Lebreton & Karin Bacily & Stefano Palminteri & Jan B Engelmann, 2019. "Contextual influence on confidence judgments in human reinforcement learning," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-27, April.
Jean-Pierre Benoît & Juan Dubra & Giorgia Romagnoli, 2022. "Belief Elicitation When More than Money Matters: Controlling for "Control"," American Economic Journal: Microeconomics, American Economic Association, vol. 14(3), pages 837-888, August.
- Juan Dubra & Jean-Pierre Benoit & Giorgia Romagnoli, 2020. "Belief Elicitation When More Than Money Matters:Controlling for "Control"," Documentos de Trabajo/Working Papers 2001, Facultad de Ciencias Empresariales y Economia. Universidad de Montevideo..
Rafkin, Charlie & Shreekumar, Advik & Vautrey, Pierre-Luc, 2021. "When guidance changes: Government stances and public beliefs," Journal of Public Economics, Elsevier, vol. 196(C).
Folli, Dominik & Wolff, Irenaeus, 2022. "Biases in belief reports," Journal of Economic Psychology, Elsevier, vol. 88(C).
- Bauer, Dominik & Wolff, Irenaeus, 2021. "Biases in Belief Reports," VfS Annual Conference 2021 (Virtual Conference): Climate Economics 242458, Verein für Socialpolitik / German Economic Association.
Stefano Palminteri & Germain Lefebvre & Emma J Kilford & Sarah-Jayne Blakemore, 2017. "Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing," PLOS Computational Biology, Public Library of Science, vol. 13(8), pages 1-22, August.
Jan B. Engelmann & Maël Lebreton & Nahuel A. Salem-Garcia & Peter Schwardmann & Joël J. van der Weele, 2024. "Anticipatory Anxiety and Wishful Thinking," American Economic Review, American Economic Association, vol. 114(4), pages 926-960, April.
- Jan Engelmann & Maël Lebreton & Peter Schwardmann & Joël van der Weele & Li-Ang Chang, 2019. "Anticipatory Anxiety and Wishful Thinking," Tinbergen Institute Discussion Papers 19-042/I, Tinbergen Institute.
- Engelmann, Jan & LeBreton, MaÃ«l & Salem-Garcia, Nahuel & Schwardmann, Peter & van der Weele, JoÃ«l, 2022. "Anticipatory Anxiety and Wishful Thinking," CEPR Discussion Papers 17665, C.E.P.R. Discussion Papers.
Johann Lussange & Stefano Vrizzi & Stefano Palminteri & Boris Gutkin, 2024. "Modelling crypto markets by multi-agent reinforcement learning," Papers 2402.10803, arXiv.org.
Juan Dubra & Jean-Pierre Benoît & Giorgia Romagnoli, 2019. "Belief elicitation when more than money matters," Documentos de Trabajo/Working Papers 1901, Facultad de Ciencias Empresariales y Economia. Universidad de Montevideo..
- Benoît, Jean-Pierre & Dubra, Juan & Romagnoli, Giorgia, 2019. "Belief elicitation when more than money matters," MPRA Paper 95550, University Library of Munich, Germany.
Dominik Bauer & Irenaeus Wolff, 2018. "Biases in Beliefs: Experimental Evidence," TWI Research Paper Series 109, Thurgauer Wirtschaftsinstitut, UniversitÃ¤t Konstanz.
Bauer, Dominik & Wolff, Irenaeus, 2019. "Biases in Beliefs," VfS Annual Conference 2019 (Leipzig): 30 Years after the Fall of the Berlin Wall - Democracy and Market Economy 203601, Verein für Socialpolitik / German Economic Association.
Gneezy, Uri & Saccardo, Silvia & Serra-Garcia, Marta & van Veldhuizen, Roel, 2020. "Bribing the Self," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 120, pages 311-324.
- Gneezy, Uri & Saccardo, Silvia & Serra-Garcia, Marta & van Veldhuizen, Roel, 2020. "Bribing the Self," Games and Economic Behavior, Elsevier, vol. 120(C), pages 311-324.
- Uri Gneezy & Silvia Saccardo & Marta Serra-Garcia & Roel van Veldhuizen, 2020. "Bribing the Self," CESifo Working Paper Series 8065, CESifo.
Thomas Garcia & Sébastien Massoni, 2017. "Aiming to choose correctly or to choose wisely? The optimality-accuracy trade-off in decisions under uncertainty," Working Papers halshs-01631540, HAL.
- Thomas Garcia & Sébastien Massoni, 2017. "Aiming to choose correctly or to choose wisely ? The optimality-accuracy trade-off in decisions under uncertainty," Working Papers 1714, Groupe d'Analyse et de Théorie Economique Lyon St-Étienne (GATE Lyon St-Étienne), Université de Lyon.
Lefebvre, Germain & Nioche, Aurélien & Bourgeois-Gironde, Sacha & Palminteri, Stefano, 2018. "An Empirical Investigation of the Emergence of Money: Contrasting Temporal Difference and Opportunity Cost Reinforcement Learning," MPRA Paper 85586, University Library of Munich, Germany.
Johann Lussange & Boris Gutkin, 2023. "Order book regulatory impact on stock market quality: a multi-agent reinforcement learning perspective," Papers 2302.04184, arXiv.org.
Fezzi, Carlo & Menapace, Luisa & Raffaelli, Roberta, 2021. "Estimating risk preferences integrating insurance choices with subjective beliefs," European Economic Review, Elsevier, vol. 135(C).
Markus M. Möbius & Muriel Niederle & Paul Niehaus & Tanya S. Rosenblat, 2022. "Managing Self-Confidence: Theory and Experimental Evidence," Management Science, INFORMS, vol. 68(11), pages 7793-7817, November.
- Markus M. Mobius & Muriel Niederle & Paul Niehaus & Tanya S. Rosenblat, 2011. "Managing Self-Confidence: Theory and Experimental Evidence," NBER Working Papers 17014, National Bureau of Economic Research, Inc.
- Markus M. Mobius & Muriel Niederle & Paul Niehaus & Tanya Rosenblat, 2011. "Managing self-confidence: theory and experimental evidence," Working Papers 11-14, Federal Reserve Bank of Boston.
Dmitri Vinogradov & Yousef Makhlouf, 2021. "Signaling probabilities in ambiguity: who reacts to vague news?," Theory and Decision, Springer, vol. 90(3), pages 371-404, May.
Zahra Murad & Charitini Stavropoulou & Graham Cookson, 2019. "Incentives and gender in a multi-task setting: An experimental study with real-effort tasks," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-18, March.
- Zahra Murad & Charitini Stavropoulou & Graham Cookson, 2018. "Incentives and Gender in a Multitask Setting: an Experimental Study with Real-Effort Tasks," Working Papers in Economics & Finance 2018-07, University of Portsmouth, Portsmouth Business School, Economics and Finance Subject Group.
Bucciol, Alessandro & Quercia, Simone & Sconti, Alessia, 2021. "Promoting financial literacy among the elderly: Consequences on confidence," Journal of Economic Psychology, Elsevier, vol. 87(C).
- Alessandro Bucciol & Simone Quercia & Alessia Sconti, 2020. "Promoting Financial Literacy among the Elderly: Consequences on Confidence," Working Papers 12/2020, University of Verona, Department of Economics.
Murad, Zahra & Starmer, Chris, 2021. "Confidence snowballing and relative performance feedback," Journal of Economic Behavior & Organization, Elsevier, vol. 190(C), pages 550-572.
- Zahra Murad & Chris Starmer, 2020. "Confidence Snowballing and Relative Performance Feedback," Discussion Papers 2020-08, The Centre for Decision Research and Experimental Economics, School of Economics, University of Nottingham.
- Zahra Murad & Chris Starmer, 2020. "Confidence Snowballing and Relative Performance Feedback," Working Papers in Economics & Finance 2020-08, University of Portsmouth, Portsmouth Business School, Economics and Finance Subject Group.

More about this item

Keywords

Decision; Decision making; Human behaviour; Learning algorithms;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:pseptp:halshs-04409145. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Caroline Bauer (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Neural and computational underpinnings of biased confidence in human reinforcement learning

Author

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data