GPI-tree search : algorithms for decision-time planning with the general policy improvement theorem

Bron
Adaptive and Learning Agents Workshop (ALA), collocated with AAMAS, 29-30 May, 2023, London, UK- () p. 1-8