Book Content
chapters • 6h8m total length
1. Up and Running with Reinforcement Learning
2. Temporal Difference, SARSA, and Q-Learning
3. Deep Q-Network
4. Double DQN, Dueling Architectures, and Rainbow
5. Deep Deterministic Policy Gradient
6. Asynchronous Methods - A3C and A2C
7. Trust Region Policy Optimization and Proximal Policy Optimization
8. Deep RL Applied to Autonomous Driving














