Reward design

Loading...
Ver programa

Reseñas

4.2 (416 calificaciones)
  • 5 stars
    57.69%
  • 4 stars
    23.31%
  • 3 stars
    9.13%
  • 2 stars
    4.32%
  • 1 star
    5.52%
LJ
6 de oct. de 2019

Challenging (unlike many other courses on Coursera, it does not baby you and does not seem to be targeting as high a pass rate as possible), but very very rewarding.

IZ
6 de jul. de 2020

Well paced lectures and exercises, clear explanations and fun programming tasks. I hope to use some of these tools in the real world.

De la lección
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

Impartido por:

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Placeholder

    Alexander Panin

    Lecturer

Explora nuestro catálogo

Inscríbete de manera gratuita y obtén recomendaciones personalizadas, actualizaciones y ofertas.