Browsing Computer Science - Undergraduate degree by Subject "On-line planning"
Now showing items 1-1 of 1
-
O-MuZero : abstract planning models Induced by Options on the MuZero Algorithm
(2021) [Work completion of graduation]Training Reinforcement Learning agents that learn both the value function and the envi ronment model can be a very time consuming method, one of the main reasons for that is that these agents learn by actions one step at ...