default search action
Edgar A. Duéñez-Guzmán
Person information
- affiliation: DeepMind, London, UK
- affiliation (PhD 2009): University of Tennessee, Knoxville, TN, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c12]Ian Gemp, Marc Lanctot, Luke Marris, Yiran Mao, Edgar A. Duéñez-Guzmán, Sarah Perrin, Andras Gyorgy, Romuald Elie, Georgios Piliouras, Michael Kaisers, Daniel Hennes, Kalesha Bullard, Kate Larson, Yoram Bachrach:
Approximating the Core via Iterative Coalition Sampling. AAMAS 2024: 669-678 - [i21]Ian Gemp, Marc Lanctot, Luke Marris, Yiran Mao, Edgar A. Duéñez-Guzmán, Sarah Perrin, Andras Gyorgy, Romuald Elie, Georgios Piliouras, Michael Kaisers, Daniel Hennes, Kalesha Bullard, Kate Larson, Yoram Bachrach:
Approximating the Core via Iterative Coalition Sampling. CoRR abs/2402.03928 (2024) - [i20]Edgar A. Duéñez-Guzmán, Suzanne Sadedin, Jane X. Wang, Kevin R. McKee, Joel Z. Leibo:
A social path to human-like artificial intelligence. CoRR abs/2405.15815 (2024) - 2023
- [j6]Madeline G. Reinecke, Yiran Mao, Markus Kunesch, Edgar A. Duéñez-Guzmán, Julia Haas, Joel Z. Leibo:
The Puzzle of Evaluating Moral Cognition in Artificial Agents. Cogn. Sci. 47(8) (2023) - [j5]Edgar A. Duéñez-Guzmán, Suzanne Sadedin, Jane X. Wang, Kevin R. McKee, Joel Z. Leibo:
A social path to human-like artificial intelligence. Nat. Mac. Intell. 5(11): 1181-1188 (2023) - [c11]Peter Sunehag, Alexander Sasha Vezhnevets, Edgar A. Duéñez-Guzmán, Igor Mordatch, Joel Z. Leibo:
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition. AAMAS 2023: 2827-2829 - [i19]Peter Sunehag, Alexander Sasha Vezhnevets, Edgar A. Duéñez-Guzmán, Igor Mordatch, Joel Z. Leibo:
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition. CoRR abs/2302.01180 (2023) - [i18]Udari Madhushani, Kevin R. McKee, John P. Agapiou, Joel Z. Leibo, Richard Everett, Thomas W. Anthony, Edward Hughes, Karl Tuyls, Edgar A. Duéñez-Guzmán:
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas. CoRR abs/2305.00768 (2023) - [i17]Yiran Mao, Madeline G. Reinecke, Markus Kunesch, Edgar A. Duéñez-Guzmán, Ramona Comanescu, Julia Haas, Joel Z. Leibo:
Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity. CoRR abs/2305.18269 (2023) - [i16]Alexander Sasha Vezhnevets, John P. Agapiou, Avia Aharon, Ron Ziv, Jayd Matyas, Edgar A. Duéñez-Guzmán, William A. Cunningham, Simon Osindero, Danny Karmon, Joel Z. Leibo:
Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia. CoRR abs/2312.03664 (2023) - 2022
- [j4]Ian Gemp, Thomas W. Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard, Jerome T. Connor, Vibhavari Dasagi, Bart De Vylder, Edgar A. Duéñez-Guzmán, Romuald Elie, Richard Everett, Daniel Hennes, Edward Hughes, Mina Khan, Marc Lanctot, Kate Larson, Guy Lever, Siqi Liu, Luke Marris, Kevin R. McKee, Paul Muller, Julien Pérolat, Florian Strub, Andrea Tacchetti, Eugene Tarassov, Zhe Wang, Karl Tuyls:
Developing, evaluating and scaling learning agents in multi-agent environments. AI Commun. 35(4): 271-284 (2022) - [c10]Ian Gemp, Kevin R. McKee, Richard Everett, Edgar A. Duéñez-Guzmán, Yoram Bachrach, David Balduzzi, Andrea Tacchetti:
D3C: Reducing the Price of Anarchy in Multi-Agent Learning. AAMAS 2022: 498-506 - [i15]Kavya Kopparapu, Edgar A. Duéñez-Guzmán, Jayd Matyas, Alexander Sasha Vezhnevets, John P. Agapiou, Kevin R. McKee, Richard Everett, Janusz Marecki, Joel Z. Leibo, Thore Graepel:
Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria. CoRR abs/2201.01816 (2022) - [i14]Ian Gemp, Thomas W. Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard, Jerome T. Connor, Vibhavari Dasagi, Bart De Vylder, Edgar A. Duéñez-Guzmán, Romuald Elie, Richard Everett, Daniel Hennes, Edward Hughes, Mina Khan, Marc Lanctot, Kate Larson, Guy Lever, Siqi Liu, Luke Marris, Kevin R. McKee, Paul Muller, Julien Pérolat, Florian Strub, Andrea Tacchetti, Eugene Tarassov, Zhe Wang, Karl Tuyls:
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments. CoRR abs/2209.10958 (2022) - [i13]John P. Agapiou, Alexander Sasha Vezhnevets, Edgar A. Duéñez-Guzmán, Jayd Matyas, Yiran Mao, Peter Sunehag, Raphael Köster, Udari Madhushani, Kavya Kopparapu, Ramona Comanescu, DJ Strouse, Michael Bradley Johanson, Sukhdeep Singh, Julia Haas, Igor Mordatch, Dean Mobbs, Joel Z. Leibo:
Melting Pot 2.0. CoRR abs/2211.13746 (2022) - 2021
- [c9]Joel Z. Leibo, Edgar A. Duéñez-Guzmán, Alexander Vezhnevets, John P. Agapiou, Peter Sunehag, Raphael Koster, Jayd Matyas, Charlie Beattie, Igor Mordatch, Thore Graepel:
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot. ICML 2021: 6187-6199 - [i12]Eugene Vinitsky, Raphael Köster, John P. Agapiou, Edgar A. Duéñez-Guzmán, Alexander Sasha Vezhnevets, Joel Z. Leibo:
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings. CoRR abs/2106.09012 (2021) - [i11]Joel Z. Leibo, Edgar A. Duéñez-Guzmán, Alexander Sasha Vezhnevets, John P. Agapiou, Peter Sunehag, Raphael Koster, Jayd Matyas, Charles Beattie, Igor Mordatch, Thore Graepel:
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot. CoRR abs/2107.06857 (2021) - [i10]Edgar A. Duéñez-Guzmán, Kevin R. McKee, Yiran Mao, Ben Coppin, Silvia Chiappa, Alexander Sasha Vezhnevets, Michiel A. Bakker, Yoram Bachrach, Suzanne Sadedin, William Isaac, Karl Tuyls, Joel Z. Leibo:
Statistical discrimination in learning agents. CoRR abs/2110.11404 (2021) - 2020
- [c8]Daniel Hennes, Dustin Morrill, Shayegan Omidshafiei, Rémi Munos, Julien Pérolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Paavo Parmas, Edgar A. Duéñez-Guzmán, Karl Tuyls:
Neural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients. AAMAS 2020: 492-501 - [c7]Kevin R. McKee, Ian Gemp, Brian McWilliams, Edgar A. Duéñez-Guzmán, Edward Hughes, Joel Z. Leibo:
Social Diversity and Social Preferences in Mixed-Motive Reinforcement Learning. AAMAS 2020: 869-877 - [c6]Yinlam Chow, Ofir Nachum, Aleksandra Faust, Edgar A. Duéñez-Guzmán, Mohammad Ghavamzadeh:
Safe Policy Learning for Continuous Control. CoRL 2020: 801-821 - [i9]Kevin R. McKee, Ian Gemp, Brian McWilliams, Edgar A. Duéñez-Guzmán, Edward Hughes, Joel Z. Leibo:
Social Diversity and Social Preferences in Mixed-Motive Reinforcement Learning. CoRR abs/2002.02325 (2020) - [i8]Ian Gemp, Kevin R. McKee, Richard Everett, Edgar A. Duéñez-Guzmán, Yoram Bachrach, David Balduzzi, Andrea Tacchetti:
D3C: Reducing the Price of Anarchy in Multi-Agent Learning. CoRR abs/2010.00575 (2020) - [i7]Raphael Köster, Kevin R. McKee, Richard Everett, Laura Weidinger, William S. Isaac, Edward Hughes, Edgar A. Duéñez-Guzmán, Thore Graepel, Matthew M. Botvinick, Joel Z. Leibo:
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences. CoRR abs/2010.09054 (2020) - [i6]Charles Beattie, Thomas Köppe, Edgar A. Duéñez-Guzmán, Joel Z. Leibo:
DeepMind Lab2D. CoRR abs/2011.07027 (2020)
2010 – 2019
- 2019
- [c5]Jane X. Wang, Edward Hughes, Chrisantha Fernando, Wojciech M. Czarnecki, Edgar A. Duéñez-Guzmán, Joel Z. Leibo:
Evolving Intrinsic Motivations for Altruistic Behavior. AAMAS 2019: 683-692 - [c4]Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. AAMAS 2019: 1099-1107 - [i5]Yinlam Chow, Ofir Nachum, Aleksandra Faust, Mohammad Ghavamzadeh, Edgar A. Duéñez-Guzmán:
Lyapunov-based Safe Policy Optimization for Continuous Control. CoRR abs/1901.10031 (2019) - 2018
- [c3]Edward Hughes, Joel Z. Leibo, Matthew Phillips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel:
Inequity aversion improves cooperation in intertemporal social dilemmas. NeurIPS 2018: 3330-3340 - [c2]Yinlam Chow, Ofir Nachum, Edgar A. Duéñez-Guzmán, Mohammad Ghavamzadeh:
A Lyapunov-based Approach to Safe Reinforcement Learning. NeurIPS 2018: 8103-8112 - [i4]Edward Hughes, Joel Z. Leibo, Matthew G. Philips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel:
Inequity aversion resolves intertemporal social dilemmas. CoRR abs/1803.08884 (2018) - [i3]Yinlam Chow, Ofir Nachum, Edgar A. Duéñez-Guzmán, Mohammad Ghavamzadeh:
A Lyapunov-based Approach to Safe Reinforcement Learning. CoRR abs/1805.07708 (2018) - [i2]Jane X. Wang, Edward Hughes, Chrisantha Fernando, Wojciech M. Czarnecki, Edgar A. Duéñez-Guzmán, Joel Z. Leibo:
Evolving intrinsic motivations for altruistic behavior. CoRR abs/1811.05931 (2018) - [i1]Joel Z. Leibo, Julien Pérolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar A. Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel:
Malthusian Reinforcement Learning. CoRR abs/1812.07019 (2018) - 2015
- [j3]Eliseo Ferrante, Ali Emre Turgut, Edgar A. Duéñez-Guzmán, Marco Dorigo, Tom Wenseleers:
Evolution of Self-Organized Task Specialization in Robot Swarms. PLoS Comput. Biol. 11(8) (2015) - 2013
- [j2]Edgar A. Duéñez-Guzmán, Michael D. Vose:
No Free Lunch and Benchmarks. Evol. Comput. 21(2): 293-312 (2013) - [c1]Eliseo Ferrante, Edgar A. Duéñez-Guzmán, Ali Emre Turgut, Tom Wenseleers:
GESwarm: grammatical evolution for the automatic synthesis of collective behaviors in swarm robotics. GECCO 2013: 17-24 - 2011
- [j1]Marte A. Ramírez-Ortegón, Edgar A. Duéñez-Guzmán, Raúl Rojas, Erik Cuevas:
Unsupervised measures for parameter selection of binarization algorithms. Pattern Recognit. 44(3): 491-502 (2011)
Coauthor Index
aka: Alexander Sasha Vezhnevets
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-05 21:17 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint