Successor Clusters : A Behavior Basis for Unsupervised Zero-Shot Reinforcement Learning

Bron
Transactions on Machine Learning Research - ISSN 2835-8856-07 (2025) p. 1-33
Auteur(s)

GPI-tree search : algorithms for decision-time planning with the general policy improvement theorem

Bron
Adaptive and Learning Agents Workshop (ALA), collocated with AAMAS, 29-30 May, 2023, London, UK- () p. 1-8

A case for feature-based successor features for transfer in reinforcement learning

Bron
34th Benelux Conference on Artificial Intelligence and the 31 Belgium Dutch Conference on Machine Learning (BNAIC/BENELEARN 2022), 7-9 November, 2022, Mechelen, Belgium- () p. 1-16