Proximal Policy Optimization (PPO) is Easy With PyTorch | Full PPO Tutorial

Machine Learning with Phil 76,341 lượt xem 4 years ago

Video Not Working? Fix It Now

Proximal Policy Optimization is an advanced actor critic algorithm designed to improve performance by constraining updates to our actor network. It's relatively straight forward to implement in code, and in this full tutorial you're going to get a mini lecture covering the essential concepts behind the ppo algorithm, as well as a complete implementation in the pytorch framework. We'll test our algorithm in a simple open ai gym environment: the cartpole.

Code for this video is here:
https://github.com/philtabor/Youtube-Code-Repository/tree/master/ReinforcementLearning/PolicyGradient/PPO/torch

A written crash course in PPO can be found here:
https://www.neuralnet.ai/a-crash-course-in-proximal-policy-optimization/

Learn how to turn deep reinforcement learning papers into code:

Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.

Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to sales@neuralnet.ai

https://www.neuralnet.ai/courses

Or, pickup my Udemy courses here:

Deep Q Learning:
https://www.udemy.com/course/deep-q-learning-from-paper-to-code/?couponCode=DQN-JUNE-22

Actor Critic Methods:
https://www.udemy.com/course/actor-critic-methods-from-paper-to-code-with-pytorch/?couponCode=AC-JUNE-22

Curiosity Driven Deep Reinforcement Learning
https://www.udemy.com/course/curiosity-driven-deep-reinforcement-learning/?couponCode=ICM-JUNE-22

Natural Language Processing from First Principles:
https://www.udemy.com/course/natural-language-processing-from-first-principles/?couponCode=NLP-JUNE-22
Reinforcement Learning Fundamentals
https://www.manning.com/livevideo/reinforcement-learning-in-motion

Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: https://bit.ly/3fXHy8W
Grokking Deep Learning: https://bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: https://bit.ly/2VNAXql

Come hang out on Discord here:
https://discord.gg/Zr4VCdv

Need personalized tutoring? Help on a programming project? Shoot me an email! phil@neuralnet.ai

Website: https://www.neuralnet.ai
Github: https://github.com/philtabor
Twitter: https://twitter.com/MLWithPhil

proximal policy optimization

proximal policy optimization tutorial

proximal policy optimization algorithm

proximal policy optimization python

proximal policy optimization explained

proximal policy optimization example

ppo open ai gym

ppo example

ppo pytorch

proximal policy optimization pytorch

actor critic reinforcement learning

openai gym cartpole tutorial

open ai gym tutorial

ppo discrete action space

ppo cartpole

Comment