Reward design

video-placeholder
Loading...
Ver programa

Reseñas

4.2 (443 calificaciones)

  • 5 stars
    58,23 %
  • 4 stars
    23,25 %
  • 3 stars
    8,80 %
  • 2 stars
    4,06 %
  • 1 star
    5,64 %

SF

8 de abr. de 2020

Filled StarFilled StarFilled StarFilled StarFilled Star

At times it felt like a bit more video material would be helpful to better understand the subject/gain deeper understanding.\n\nAnd fixing some of the notebooks would be helpful.

FZ

13 de feb. de 2019

Filled StarFilled StarFilled StarFilled StarFilled Star

A great course with very practical assignments to help you learn how to implement RL algorithms. But it also has some stupid quiz questions which makes you feel confusing.

De la lección

At the heart of RL: Dynamic Programming

This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

Impartido por:

  • Placeholder

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab

  • Placeholder

    Alexander Panin

    Lecturer

Explora nuestro catálogo

Inscríbete de manera gratuita y obtén recomendaciones personalizadas, actualizaciones y ofertas.