Listar Tesinas de Curso de Grado por autor "Jacobi, Otavio Flores"
Mostrando ítems 1-1 de 1
-
O-MuZero : abstract planning models Induced by Options on the MuZero Algorithm
Jacobi, Otavio Flores (2021) [Tesinas de grado]Training Reinforcement Learning agents that learn both the value function and the envi ronment model can be a very time consuming method, one of the main reasons for that is that these agents learn by actions one step at ...