Transfer and zero-shot reinforcement learning : learning behaviors without a reward function
Bron
Antwerpen, University of Antwerp, Faculty of Science,xxii, 144 p.
Successor Clusters : A Behavior Basis for Unsupervised Zero-Shot Reinforcement Learning
Bron
Transactions on Machine Learning Research - ISSN 2835-8856-07 (2025) p. 1-33
GPI-tree search : algorithms for decision-time planning with the general policy improvement theorem
Bron
Neural computing and applications - ISSN 0941-0643-37:23 (2025) p. 18989-19007
GPI-tree search : algorithms for decision-time planning with the general policy improvement theorem
Bron
Adaptive and Learning Agents Workshop (ALA), collocated with AAMAS, 29-30 May, 2023, London, UK- () p. 1-8
Deep learning of intrinsically motivated options in the arcade learning environment
Bron
Deep Reinforcement Learning Workshop, NeurIPS 2022, 9 December, 2022- () p. 1-14