Lecture 4 of a 6-lecture series on the Foundations of Deep RL Topic: Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO) Instructor: Pieter Abbeel Slides: https://www.dropbox.com/s/bodgpysmm6lu998/l4-TRPO-PPO.pdf?dl=0