In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning.
Este curso forma parte de Programa especializado: Aprendizaje por refuerzo
Ofrecido Por
Acerca de este Curso
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode
Habilidades que obtendrás
- Artificial Intelligence (AI)
- Machine Learning
- Reinforcement Learning
- Function Approximation
- Intelligent Systems
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode
Programa - Qué aprenderás en este curso
Welcome to the Course!
Monte Carlo Methods for Prediction & Control
Temporal Difference Learning Methods for Prediction
Temporal Difference Learning Methods for Control
Planning, Learning & Acting
Reseñas
- 5 stars82 %
- 4 stars13,58 %
- 3 stars2,85 %
- 2 stars0,60 %
- 1 star0,95 %
Principales reseñas sobre SAMPLE-BASED LEARNING METHODS
Overall a very nice course, well explained and presented.
Sometimes, it would be nice to see the slides 'full screen' rather than the small version in the corner.
Good balance of theory and programming assignments. I really like the weekly bonus videos with professors and developers. Recommend to everyone.
It's an important course in understanding the working of reinforcement learning. Although some important and complex topics are not explored in this course which are mentioned in the textbook.
Excellent course that naturally extends the first specialization course. The application examples in programming are very good and I loved how RL gets closer and closer to how a living being thinks.
Acerca de Programa especializado: Aprendizaje por refuerzo

Preguntas Frecuentes
¿Cuándo podré acceder a las lecciones y tareas?
¿Qué recibiré si me suscribo a este Programa especializado?
¿Hay ayuda económica disponible?
¿Tienes más preguntas? Visita el Centro de Ayuda al Estudiante.