My bibliography Save this paper

Disentangling Exploration from Exploitation

Author

Listed:

Alessandro Lizzeri
Eran Shmaya
Leeat Yariv

Alessandro Lizzeri

Abstract

Starting from Robbins (1952), the literature on experimentation via multi-armed bandits has wed exploration and exploitation. Nonetheless, in many applications, agents' exploration and exploitation need not be intertwined: a policymaker may assess new policies different than the status quo; an investor may evaluate projects outside her portfolio. We characterize the optimal experimentation policy when exploration and exploitation are disentangled in the case of Poisson bandits, allowing for general news structures. The optimal policy features complete learning asymptotically, exhibits lots of persistence, but cannot be identified by an index à la Gittins. Disentanglement is particularly valuable for intermediate parameter values.

Suggested Citation

Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," NBER Working Papers 32424, National Bureau of Economic Research, Inc.

Handle: RePEc:nbr:nberwo:32424
Note: EH IO LE LS ME POL

Download full text from publisher

As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

Other versions of this item:

Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," Papers 2404.19116, arXiv.org.
Lizzeri, Alessandro & Shmaya, Eran & Yariv, Leeat, 2024. "Disentangling Exploration from Exploitation," CEPR Discussion Papers 19058, C.E.P.R. Discussion Papers.
Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," Working Papers 334, Princeton University, Department of Economics, Center for Economic Policy Studies..

References listed on IDEAS

Janet M. Currie & W. Bentley MacLeod, 2020. "Understanding Doctor Decision Making: The Case of Depression Treatment," Econometrica, Econometric Society, vol. 88(3), pages 847-878, May.
Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
Miller, Robert A, 1984. "Job Matching and Occupational Choice," Journal of Political Economy, University of Chicago Press, vol. 92(6), pages 1086-1120, December.
Yeon-Koo Che & Konrad Mierendorff, 2019. "Optimal Dynamic Allocation of Attention," American Economic Review, American Economic Association, vol. 109(8), pages 2993-3029, August.
- Yeon-Koo Che & Konrad Mierendorff, 2018. "Optimal Dynamic Allocation of Attention," Papers 1812.06967, arXiv.org.
Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R, Cowles Foundation for Research in Economics, Yale University, revised Feb 2012.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R3, Cowles Foundation for Research in Economics, Yale University, revised Jun 2013.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726, Cowles Foundation for Research in Economics, Yale University.
- Johannes Horner & Larry Samuelson, 2012. "Incentives for Experimenting Agents," Levine's Working Paper Archive 786969000000000418, David K. Levine.
- Johannes Horner & Larry Samuelson, 2013. "Incentives for Experimenting Agents," Levine's Working Paper Archive 786969000000000671, David K. Levine.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R2, Cowles Foundation for Research in Economics, Yale University, revised Mar 2013.
, & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
- Sven Rady & Godfrey Keller, 2007. "Strategic Experimentation with Poisson Bandits," 2007 Meeting Papers 332, Society for Economic Dynamics.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 260, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Papers in Economics 10575, University of Munich, Department of Economics.
- Rady, Sven & Keller, R Godfrey, 2009. "Strategic Experimentation with Poisson Bandits," CEPR Discussion Papers 7270, C.E.P.R. Discussion Papers.
Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
- Rady, Sven & Cripps, Martin William & Keller, R Godfrey, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.
- Cripps, Martin & Keller, Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," Discussion Papers in Economics 4, University of Munich, Department of Economics.
- Godfrey Keller & Martin Cripps & Olin School of Business & Washington University & Sven Rady & Department of Economics & University of Munich, 2003. "Strategic Experimentation with Exponential Bandits," Economics Series Working Papers 143, University of Oxford, Department of Economics.
Ettore Damiano & Hao Li & Wing Suen, 2020. "Learning While Experimenting," The Economic Journal, Royal Economic Society, vol. 130(625), pages 65-92.
Jovanovic, Boyan, 1979. "Job Matching and the Theory of Turnover," Journal of Political Economy, University of Chicago Press, vol. 87(5), pages 972-990, October.
- Thomas Sargent, "undated". "Matlab code for Jovanovic's matching model," QM&RBC Codes 24, Quantitative Macroeconomics & Real Business Cycles.
repec:cwl:cwldpp:1726rrr is not listed on IDEAS
repec:cwl:cwldpp:1726rr is not listed on IDEAS
Annie Liang & Xiaosheng Mu & Vasilis Syrgkanis, 2022. "Dynamically Aggregating Diverse Information," Econometrica, Econometric Society, vol. 90(1), pages 47-80, January.
Sims, Christopher A., 2003. "Implications of rational inattention," Journal of Monetary Economics, Elsevier, vol. 50(3), pages 665-690, April.
Bruno Strulovici, 2010. "Learning While Voting: Determinants of Collective Experimentation," Econometrica, Econometric Society, vol. 78(3), pages 933-971, May.
- Bruno Strulovici, 2008. "Learning while voting: determinants of collective experimentation," Economics Papers 2008-W08, Economics Group, Nuffield College, University of Oxford.
Bartosz Maćkowiak & Filip Matějka & Mirko Wiederholt, 2023. "Rational Inattention: A Review," Journal of Economic Literature, American Economic Association, vol. 61(1), pages 226-273, March.
- Mackowiak, Bartosz & MatÄ›jka, Filip & Wiederholt, Mirko, 2020. "Rational Inattention: A Review," CEPR Discussion Papers 15408, C.E.P.R. Discussion Papers.
- Bartosz Maćkowiak & Filip Matějka & Mirko Wiederholt, 2023. "Rational Inattention: A Review," SciencePo Working papers Main hal-03878692, HAL.
- Maćkowiak, Bartosz & Matějka, Filip & Wiederholt, Mirko, 2021. "Rational inattention: a review," Working Paper Series 2570, European Central Bank.
- Bartosz Maćkowiak & Filip Matějka & Mirko Wiederholt, 2023. "Rational Inattention: A Review," Post-Print hal-03878692, HAL.
Yingni Guo, 2016. "Dynamic Delegation of Experimentation," American Economic Review, American Economic Association, vol. 106(8), pages 1969-2008, August.
Janet M. Currie & W. Bentley MacLeod, 2018. "Understanding Doctor Decision Making: The Case of Depression," NBER Working Papers 24955, National Bureau of Economic Research, Inc.
- Janet M. Currie & W. Bentley MacLeod, 2020. "Understanding Doctor Decision Making: The Case of Depression," Working Papers 2020-77, Princeton University. Economics Department..
Yeon-Koo Che & Johannes Hörner, 2018. "Recommender Systems as Mechanisms for Social Learning," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 133(2), pages 871-925.
Gilles Stoltz & Sébastien Bubeck & Rémi Munos, 2011. "Pure exploration in finitely-armed and continuous-armed bandits," Post-Print hal-00609550, HAL.
Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
Bergemann, Dirk & Hege, Ulrich, 1998. "Venture capital financing, moral hazard, and learning," Journal of Banking & Finance, Elsevier, vol. 22(6-8), pages 703-735, August.
- Bergemann, Dirk & Hege, Ulrich, 1997. "Venture Capital Financing, Moral Hazard and Learning," CEPR Discussion Papers 1738, C.E.P.R. Discussion Papers.
- Ulrich Hege & Dirk Bergemann, 1998. "Venture capital financing, moral hazard, and learning," Post-Print hal-00481696, HAL.
- Bergemann, D. & Hege, U., 1997. "Venture Capital Financing, Moral Hazard and Learning," Other publications TiSEM d70119dd-1d85-4dde-9d59-1, Tilburg University, School of Economics and Management.
Annie Liang & Xiaosheng Mu, 2020. "Complementary Information and Learning Traps," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 135(1), pages 389-448.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Chen, Chia-Hui & Ishida, Junichiro, 2018. "Hierarchical experimentation," Journal of Economic Theory, Elsevier, vol. 177(C), pages 365-404.
- Chia-Hui Chen & Junichiro Ishida, 2015. "Hierarchical Experimentation," ISER Discussion Paper 0949, Institute of Social and Economic Research, Osaka University.
Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
- Heidhues, Paul & Rady, Sven & Strack, Philipp, 2012. "Strategic Experimentation with Private Payoffs," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 387, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Rady, Sven & Heidhues, Paul & Strack, Philipp, 2015. "Strategic Experimentation with Private Payoffs," CEPR Discussion Papers 10634, C.E.P.R. Discussion Papers.
Forand, Jean Guillaume, 2015. "Keeping your options open," Journal of Economic Dynamics and Control, Elsevier, vol. 53(C), pages 47-68.
- Jean Guillaume Forand, 2010. "Keeping Your Options Open," RCER Working Papers 557, University of Rochester - Center for Economic Research (RCER).
- Jean Guillaume Forand, 2011. "Keeping Your Options Open," 2011 Meeting Papers 82, Society for Economic Dynamics.
- Jean Guillaume Forand, 2013. "Keeping Your options Open," Working Papers 1301, University of Waterloo, Department of Economics, revised Feb 2015.
Mira Frick & Yuhta Ishii, 2015. "Innovation Adoption by Forward-Looking Social Learners," Cowles Foundation Discussion Papers 1877, Cowles Foundation for Research in Economics, Yale University.
Jan Eeckhout & Xi Weng, 2022. "Assortative Learning," Economica, London School of Economics and Political Science, vol. 89(355), pages 647-688, July.
- Xi Weng & Jan Eeckhout, 2010. "Assortative Learning," 2010 Meeting Papers 356, Society for Economic Dynamics.
Thomas, Caroline, 2019. "Experimentation with reputation concerns – Dynamic signalling with changing types," Journal of Economic Theory, Elsevier, vol. 179(C), pages 366-415.
Yingkai Li & Jonathan Libgober, 2023. "Implementing Evidence Acquisition: Time Dependence in Contracts for Advice," Papers 2310.19147, arXiv.org, revised Sep 2024.
Aubrey Clark & Giovanni Reggiani, 2021. "Contracts for acquiring information," Papers 2103.03911, arXiv.org.
Thomas Greve & Hans Keiding, 2023. "A model of privately funded public research," Journal of Economics, Springer, vol. 140(1), pages 63-91, September.
Weng, Xi, 2015. "Dynamic pricing in the presence of individual learning," Journal of Economic Theory, Elsevier, vol. 155(C), pages 262-299.
Sorensen, Morten, 2007. "Learning by Investing: Evidence from Venture Capital," SIFR Research Report Series 53, Institute for Financial Research.
Chen, Chia-Hui & Ishida, Junichiro & Mukherjee, Arijit, 2023. "Pioneer, early follower or late entrant: Entry dynamics with learning and market competition," European Economic Review, Elsevier, vol. 152(C).
- Chia-Hui Chen & Junichiro Ishida & Arijit Mukherjee, 2021. "Pioneer, Early Follower or Late Entrant: Entry Dynamics with Learning and Market Competition," ISER Discussion Paper 1132, Institute of Social and Economic Research, Osaka University.
Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R, Cowles Foundation for Research in Economics, Yale University, revised Feb 2012.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R3, Cowles Foundation for Research in Economics, Yale University, revised Jun 2013.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726, Cowles Foundation for Research in Economics, Yale University.
- Johannes Horner & Larry Samuelson, 2012. "Incentives for Experimenting Agents," Levine's Working Paper Archive 786969000000000418, David K. Levine.
- Johannes Horner & Larry Samuelson, 2013. "Incentives for Experimenting Agents," Levine's Working Paper Archive 786969000000000671, David K. Levine.
- Johannes Horner & Larry Samuelson, 2009. "Incentives for Experimenting Agents," Cowles Foundation Discussion Papers 1726R2, Cowles Foundation for Research in Economics, Yale University, revised Mar 2013.
Besanko, David & Tong, Jian & Wu, Jianjun, 2016. "Subsidizing research programs with "if" and "when" uncertainty in the face of severe informational constraints," Discussion Paper Series In Economics And Econometrics 1605, Economics Division, School of Social Sciences, University of Southampton.
Khalil, Fahad & Lawarree, Jacques & Rodivilov, Alexander, 2020. "Learning from failures: Optimal contracts for experimentation and production," Journal of Economic Theory, Elsevier, vol. 190(C).
- Fahad Khalil & Jacques Lawarree & Alexander Rodivilov, 2018. "Learning from Failures: Optimal Contract for Experimentation and Production," CESifo Working Paper Series 7310, CESifo.
Hu, Yingyao & Kayaba, Yutaka & Shum, Matthew, 2013. "Nonparametric learning rules from bandit experiments: The eyes have it!," Games and Economic Behavior, Elsevier, vol. 81(C), pages 215-231.
- Yingyao Hu & Yutaka Kayaba & Matthew Shum, 2010. "Nonparametric learning rules from bandit experiments: the eyes have it!," CeMMAP working papers CWP15/10, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Yingyao Hu & Yutaka Kayaba & Matt Shum, 2010. "Nonparametric Learning Rules from Bandit Experiments: The Eyes have it!," Economics Working Paper Archive 560, The Johns Hopkins University,Department of Economics.
Keller, Godfrey & Rady, Sven, 2020. "Undiscounted bandit games," Games and Economic Behavior, Elsevier, vol. 124(C), pages 43-61.
- Keller, Godfrey & Rady, Sven, 2015. "Undiscounted Bandit Games," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 520, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," Economics Series Working Papers 882, University of Oxford, Department of Economics.
- Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," CRC TR 224 Discussion Paper Series crctr224_2019_130, University of Bonn and University of Mannheim, Germany.
- Rady, Sven & Keller, R Godfrey, 2019. "Undiscounted Bandit Games," CEPR Discussion Papers 14046, C.E.P.R. Discussion Papers.
- Godfrey Keller & Sven Rady, 2020. "Undiscounted Bandit Games," CRC TR 224 Discussion Paper Series crctr224_2020_130v2, University of Bonn and University of Mannheim, Germany.
- Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," Papers 1909.13323, arXiv.org, revised Aug 2020.
Xie, Yinxi & Xie, Yang, 2017. "Machiavellian experimentation," Journal of Comparative Economics, Elsevier, vol. 45(4), pages 685-711.
Caroline D. Thomas, 2021. "Strategic Experimentation with Congestion," American Economic Journal: Microeconomics, American Economic Association, vol. 13(1), pages 1-82, February.
- Caroline D. Thomas, 2010. "Strategic Experimentation with Congestion," Department of Economics Working Papers 130813, The University of Texas at Austin, Department of Economics, revised Aug 2013.
- Caroline D Thomas, 2010. "Strategic Experimentation with Congestion," Department of Economics Working Papers 130907, The University of Texas at Austin, Department of Economics, revised 04 Nov 2014.
Emeric Henry & Marco Loseto & Marco Ottaviani, 2022. "Regulation with Experimentation: Ex Ante Approval, Ex Post Withdrawal, and Liability," Management Science, INFORMS, vol. 68(7), pages 5330-5347, July.
- Ottaviani, Marco & Loseto, Marco, 2018. "Regulation with Experimentation: Ex Ante Approval, Ex Post Withdrawal, and Liability," CEPR Discussion Papers 13224, C.E.P.R. Discussion Papers.
- Emeric Henry & Marco Loseto & Marco Ottaviani, 2022. "Regulation with Experimentation: Ex Ante Approval, Ex Post Withdrawal, and Liability," Post-Print hal-03874153, HAL.
- Emeric Henry & Marco Loseto & Marco Ottaviani, 2022. "Regulation with Experimentation: Ex Ante Approval, Ex Post Withdrawal, and Liability," SciencePo Working papers Main hal-03874153, HAL.

More about this item

JEL classification:

C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
D81 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Criteria for Decision-Making under Risk and Uncertainty
D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
O35 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Social Innovation

NEP fields

This paper has been announced in the following NEP Reports:

NEP-MIC-2024-06-24 (Microeconomics)
NEP-PPM-2024-06-24 (Project, Program and Portfolio Management)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:32424. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Disentangling Exploration from Exploitation

Author

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

JEL classification:

NEP fields

Statistics

Corrections

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data