- 1. Course Introduction
- 2. The K-Armed Bandit Problem
- 3. What to Learn? Estimating Action Values
- 4. Exploration vs. Exploitation Tradeoff
- Programming Assignment: Bandits and Exploration Exploitation
- Quiz
- 1. Introduction to Markov Decision Processes
- 2. Goal of Reinforcement Learning
- 3. Continuing Tasks
- Quiz
- 1. Policies and Value Functions
- 2. Bellman Equations
- 3. Optimality (Optimal Policies & Value Functions)
- Graded Quiz
- Practice Quiz
- 1. Policy Evaluation (Prediction)
- 2. Policy Iteration (Control)
- 3. Generalized Policy Iteration
- 4. Course Wrap-up
- Programming Assignment: Optimal Policies with Dynamic Programming
- Quiz
- 1. Course Introduction
- 2. Introduction to Monte Carlo Methods
- 3. Monte Carlo for Control
- 4. Exploration Methods for Monte Carlo
- 5. Off-policy Learning for Prediction
- Programming Assignment: Blackjack
- Quiz
- 1. Introduction to Temporal Difference Learning
- 2. Advantages of TD
- Assigment
- Quiz
- 1.TD for Control
- 2. Off-policy TD Control: Q-learning
- 3. Expected Sarsa
- Assigment
- Quiz
- 1. What is a Model?
- 2. Planning
- 3. Dyna as a formalism for planning
- 4. Dealing with inaccurate models
- Assigment
- 5. Course Wrap-up
- Quiz
- 1. Course 3 Introduction
- 2. Estimating values functions with supervised learning
- 3. The Objective for On-policy Prediction
- 4. The Object for TD
- 5. Linear TD
- Assigment
- Quiz
- 1. Feature Construction for Linear Methods
- 2. Neural Networks
- 3. Training Neural Networks
- Assigment
- Quiz
- 1. Episodic Sarsa with Function Approximation
- 2. Exploration under Function Approximation
- 3. Average Reward
- Programming Assignment: Function Approximation and Control
- Quiz
- 1. Learning Parameterized Policies
- 2. Policy Gradient for Continuing Tasks
- 3. Actor-Critic for Continuing Tasks
- 4. Policy Parameterizations
- Assigment
- 5. Course Wrap-up
- Quiz
- 1. Course Introduction
- 1. Final Project: Milestone 1
- 2. Project Resources
- Assigment
- 1. Weekly Learning Goals
- 2.Project Resources
- Quiz
- 1. Weekly Learning Goals
- 2. Project Resources
- Quiz
- 1. Weekly Learning Goals
-
2. Project Resources
-
1.Lets Review Optimization Strategies for NNs.mp4
-
2.Lets Review Expected Sarsa with Function Approximation.mp4
-
3.Lets Review Dyna & Q-learning in a Simple Maze.mp4
-
4.Meeting with Martha In-depth on Experience Replay.mp4
-
5.Martin Riedmiller on The Collect and Infer framework for data-efficient RL.mp4
-
- Assigment
- 1. Weekly Learning Goals
- 2. Project Resources
- Assigment
- 3. Congratulations!
2.Policy Iteration.mp4
Views | |
---|---|
0 | Total Views |
0 | Members Views |
0 | Public Views |
Actions | |
---|---|
0 | Likes |
0 | Dislikes |
0 | Comments |
Share by mail
Please login to share this video by email.