Transfer and zero-shot reinforcement learning : learning behaviors without a reward function

Bron
Antwerpen, University of Antwerp, Faculty of Science,xxii, 144 p.

Successor Clusters : A Behavior Basis for Unsupervised Zero-Shot Reinforcement Learning

Bron
Transactions on Machine Learning Research - ISSN 2835-8856-07 (2025) p. 1-33
Auteur(s)

GPI-tree search : algorithms for decision-time planning with the general policy improvement theorem

Bron
Adaptive and Learning Agents Workshop (ALA), collocated with AAMAS, 29-30 May, 2023, London, UK- () p. 1-8