Reinforcement learning
15 Maio 2023, 16:30 • Luís Miguel Parreira e Correia
Reinforcement learning
Bellman's dynamic programming
Q-learning
Exploitation v. exploration
15 Maio 2023, 16:30 • Luís Miguel Parreira e Correia