Publication
Title
Learning intrinsically motivated options to stimulate policy exploration
Author
Abstract
A Reinforcement Learning (RL) agent needs to find an optimal sequence of actions in order to maximize rewards. This requires consistent exploration of states and action sequences to ensure the policy found is optimal. One way to motivate exploration is through intrinsic rewards, i.e. agent-induced rewards to guide itself towards interesting behaviors. We propose to learn from such intrinsic rewards through exploration options, i.e. additional temporally-extended actions to call separate policies (or "Explorer" agents) associated to an intrinsic reward. We show that this method has several key advantages over the usual method of weighted sum of rewards, mainly task-transfer abilities and scalability to multiple reward functions.
Language
English
Source (book)
4th Lifelong Learning Workshop at ICML 2020, the 37th International Conference on Machine Learning, 18 July, 2020, Vienna Austria
Publication
2020
Volume/pages
p. 1-11
Full text (publisher's version - intranet only)
UAntwerpen
Faculty/Department
Research group
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Source file
Record
Identifier
Creation 26.11.2020
Last edited 17.06.2024
To cite this reference