Burden, John ORCID: https://orcid.org/0000-0001-7526-0753 (2020) Automating abstraction for potential-based reward shaping. PhD thesis, University of York.
Abstract
Within the field of Reinforcement Learning (RL) the successful application of abstraction can play a huge role in decreasing the time required for agents to learn competent policies. Many examples of this speed-up have been observed throughout the literature. Reward Shaping is one such technique for utilising abstractions in this way. This thesis focuses on how an agent can learn its own abstractions from its own experiences to be used for Potential Based Reward Shaping. As the thesis progresses, the environments for which the abstraction construction is automated grow in complexity and scope --- while also utilising less external knowledge of the domains. This culminates in the approaches \textit{Uniform Property State Abstraction} (UPSA) and \textit{Latent Property State Abstraction} (LPSA), which can both augment existing RL algorithms and allow them to construct abstractions from their own experience and then effectively make use of these abstractions to improve convergence time. Empirical results from this thesis demonstrate that this approach can outperform existing deep RL algorithms such as Deep Q-Networks over a range of domains.
Metadata
Supervisors: | Victoria, Hodge and Daniel, Kudenko and James, Cussens and Rob, Alexander |
---|---|
Keywords: | Reinforcement Learning; Reward Shaping; Abstraction |
Awarding institution: | University of York |
Academic Units: | The University of York > Computer Science (York) |
Identification Number/EthosID: | uk.bl.ethos.834123 |
Depositing User: | Mr John Burden |
Date Deposited: | 14 Jul 2021 09:56 |
Last Modified: | 21 Aug 2021 09:53 |
Download
Examined Thesis (PDF)
Filename: PhDThesis.pdf
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.