Policy and value iteration

Loading...
Del curso dictado por National Research University Higher School of Economics
Practical Reinforcement Learning
53 calificaciones
National Research University Higher School of Economics
53 calificaciones
Curso 4 de 7 en SpecializationAdvanced Machine Learning
De la lección
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

Conoce a los instructores

  • Pavel Shvechikov
    Pavel Shvechikov
    Researcher at HSE and Sberbank AI Lab
    HSE Faculty of Computer Science
  • Alexander Panin
    Alexander Panin
    Lecturer
    HSE Faculty of Computer Science

Explora nuestro catálogo

Inscríbete de manera gratuita y obtén recomendaciones personalizadas, actualizaciones y ofertas.