Profils utilisateurs correspondant à "Marlos C. Machado"

Marlos C. Machado

University of Alberta; Amii
Adresse e-mail validée de ualberta.ca
Cité 3268 fois

Autonomous navigation of stratospheric balloons using reinforcement learning

…, S Candido, PS Castro, J Gong, MC Machado… - Nature, 2020 - nature.com
Efficiently navigating a superpressure balloon in the stratosphere 1 requires the integration
of a multitude of cues, such as wind speed and solar elevation, and the process is …

Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents

MC Machado, MG Bellemare, E Talvitie… - Journal of Artificial …, 2018 - jair.org
The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge
of building AI agents with general competency across dozens of Atari 2600 games. It …

A laplacian framework for option discovery in reinforcement learning

MC Machado, MG Bellemare… - … on Machine Learning, 2017 - proceedings.mlr.press
Abstract Representation learning and option discovery are two of the biggest challenges in
reinforcement learning (RL). Proto-value functions (PVFs) are a well-known approach for …

Contrastive behavioral similarity embeddings for generalization in reinforcement learning

R Agarwal, MC Machado, PS Castro… - arXiv preprint arXiv …, 2021 - arxiv.org
Reinforcement learning methods trained on few environments rarely learn policies that
generalize to unseen environments. To improve generalization, we incorporate the inherent …

Count-based exploration with the successor representation

MC Machado, MG Bellemare, M Bowling - Proceedings of the AAAI …, 2020 - ojs.aaai.org
In this paper we introduce a simple approach for exploration in reinforcement learning (RL)
that allows us to develop theoretically justified algorithms in the tabular case but that is also …

Generalization and regularization in dqn

J Farebrother, MC Machado, M Bowling - arXiv preprint arXiv:1810.00123, 2018 - arxiv.org
… Introducing flavours to the ALE is not one of our contributions, this was done by Machado et
al. (2018). … Marlos C. Machado performed part of this work while at the University of Alberta. …

Eigenoption discovery through the deep successor representation

MC Machado, C Rosenbaum, X Guo, M Liu… - arXiv preprint arXiv …, 2017 - arxiv.org
Options in reinforcement learning allow agents to hierarchically decompose a task into
subtasks, having the potential to speed up learning and planning. However, autonomously …

Loss of plasticity in continual deep reinforcement learning

…, J Modayil, A White, MC Machado - … on Lifelong Learning …, 2023 - proceedings.mlr.press
In this paper, we characterize the behavior of canonical value-based deep reinforcement
learning (RL) approaches under varying degrees of non-stationarity. In particular, we …

State of the art control of atari games using shallow reinforcement learning

Y Liang, MC Machado, E Talvitie, M Bowling - arXiv preprint arXiv …, 2015 - arxiv.org
The recently introduced Deep Q-Networks (DQN) algorithm has gained attention as one of
the first successful combinations of deep neural networks and reinforcement learning. Its …

True online temporal-difference learning

…, AR Mahmood, PM Pilarski, MC Machado… - Journal of Machine …, 2016 - jmlr.org
The temporal-difference methods TD(λ) and Sarsa(λ) form a core part of modern reinforcement
learning. Their appeal comes from their good performance, low computational cost, and …