Google Scholar

Profils utilisateurs correspondant à "Marlos C. Machado"

Marlos C. Machado

University of Alberta; Amii

Adresse e-mail validée de ualberta.ca

Cité 3268 fois

Autonomous navigation of stratospheric balloons using reinforcement learning

…, S Candido, PS Castro, J Gong, MC Machado… - Nature, 2020 - nature.com

Efficiently navigating a superpressure balloon in the stratosphere 1 requires the integration
of a multitude of cues, such as wind speed and solar elevation, and the process is …

Enregistrer Citer Cité 421 fois Autres articles Les 7 versions

[PDF] jair.org

Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents

MC Machado, MG Bellemare, E Talvitie… - Journal of Artificial …, 2018 - jair.org

The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge
of building AI agents with general competency across dozens of Atari 2600 games. It …

Enregistrer Citer Cité 667 fois Autres articles Les 14 versions Version HTML

[PDF] mlr.press

A laplacian framework for option discovery in reinforcement learning

MC Machado, MG Bellemare… - … on Machine Learning, 2017 - proceedings.mlr.press

Abstract Representation learning and option discovery are two of the biggest challenges in
reinforcement learning (RL). Proto-value functions (PVFs) are a well-known approach for …

Enregistrer Citer Cité 322 fois Autres articles Les 9 versions Version HTML

[PDF] arxiv.org

Contrastive behavioral similarity embeddings for generalization in reinforcement learning

R Agarwal, MC Machado, PS Castro… - arXiv preprint arXiv …, 2021 - arxiv.org

Reinforcement learning methods trained on few environments rarely learn policies that
generalize to unseen environments. To improve generalization, we incorporate the inherent …

Enregistrer Citer Cité 207 fois Autres articles Les 11 versions Version HTML

[PDF] aaai.org

Count-based exploration with the successor representation

MC Machado, MG Bellemare, M Bowling - Proceedings of the AAAI …, 2020 - ojs.aaai.org

In this paper we introduce a simple approach for exploration in reinforcement learning (RL)
that allows us to develop theoretically justified algorithms in the tabular case but that is also …

Enregistrer Citer Cité 224 fois Autres articles Les 10 versions Version HTML

[PDF] arxiv.org

Generalization and regularization in dqn

J Farebrother, MC Machado, M Bowling - arXiv preprint arXiv:1810.00123, 2018 - arxiv.org

… Introducing flavours to the ALE is not one of our contributions, this was done by Machado et
al. (2018). … Marlos C. Machado performed part of this work while at the University of Alberta. …

Enregistrer Citer Cité 204 fois Autres articles Les 5 versions Version HTML

[PDF] arxiv.org

Eigenoption discovery through the deep successor representation

MC Machado, C Rosenbaum, X Guo, M Liu… - arXiv preprint arXiv …, 2017 - arxiv.org

Options in reinforcement learning allow agents to hierarchically decompose a task into
subtasks, having the potential to speed up learning and planning. However, autonomously …

Enregistrer Citer Cité 177 fois Autres articles Les 5 versions Version HTML

[PDF] mlr.press

Loss of plasticity in continual deep reinforcement learning

…, J Modayil, A White, MC Machado - … on Lifelong Learning …, 2023 - proceedings.mlr.press

In this paper, we characterize the behavior of canonical value-based deep reinforcement
learning (RL) approaches under varying degrees of non-stationarity. In particular, we …

Enregistrer Citer Cité 72 fois Autres articles Les 4 versions Version HTML

[PDF] arxiv.org

State of the art control of atari games using shallow reinforcement learning

Y Liang, MC Machado, E Talvitie, M Bowling - arXiv preprint arXiv …, 2015 - arxiv.org

The recently introduced Deep Q-Networks (DQN) algorithm has gained attention as one of
the first successful combinations of deep neural networks and reinforcement learning. Its …

Enregistrer Citer Cité 148 fois Autres articles Les 10 versions Version HTML

[PDF] jmlr.org

True online temporal-difference learning

…, AR Mahmood, PM Pilarski, MC Machado… - Journal of Machine …, 2016 - jmlr.org

The temporal-difference methods TD(λ) and Sarsa(λ) form a core part of modern reinforcement
learning. Their appeal comes from their good performance, low computational cost, and …

Enregistrer Citer Cité 121 fois Autres articles Les 10 versions Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Profils utilisateurs correspondant à "Marlos C. Machado"

Marlos C. Machado

Autonomous navigation of stratospheric balloons using reinforcement learning

Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents

A laplacian framework for option discovery in reinforcement learning

Contrastive behavioral similarity embeddings for generalization in reinforcement learning

Count-based exploration with the successor representation

Generalization and regularization in dqn

Eigenoption discovery through the deep successor representation

Loss of plasticity in continual deep reinforcement learning

State of the art control of atari games using shallow reinforcement learning

True online temporal-difference learning