Reiforcement learning

19 Abril 2021, 16:30 Luís Miguel Parreira e Correia

Reinforcement learning; dynamic programming; Q-learning; Stochastic Q-learning