TheSequence

Share this post

🌐 Edge#99: What are Trust Region and Proximal Policy Optimization; PPO to master Dota2; and RLlib

thesequence.substack.com

🌐 Edge#99: What are Trust Region and Proximal Policy Optimization; PPO to master Dota2; and RLlib

RL-series goes on

Jun 22, 2021
∙ Paid
4
Share

In this issue:

  • we discuss what are trust region and proximal policy optimization; 

  • we explore RLlib – an open-source framework for highly scalable reinforcement learning;

  • we learn how OpenAI used PPO reinforcement learning to master Dota 2.  

Give a gift subscription

💡 ML Concept of the Day: What are Trust Region and Proximal Policy Optimization?

In Edge#97, we continue our series…

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2023 Jesus Rodriguez
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing