L4 TRPO and PPO (Foundations of Deep RL Series)

Pieter Abbeel 36,725 4 years ago

Video Not Working? Fix It Now

Lecture 4 of a 6-lecture series on the Foundations of Deep RL Topic: Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO) Instructor: Pieter Abbeel Slides: https://www.dropbox.com/s/bodgpysmm6lu998/l4-TRPO-PPO.pdf?dl=0

Comment