TheSequence

TheSequence

Share this post

TheSequence
TheSequence
🌐 Edge#99: What are Trust Region and Proximal Policy Optimization; PPO to master Dota2; and RLlib

🌐 Edge#99: What are Trust Region and Proximal Policy Optimization; PPO to master Dota2; and RLlib

RL-series goes on

Jun 22, 2021
βˆ™ Paid
4

Share this post

TheSequence
TheSequence
🌐 Edge#99: What are Trust Region and Proximal Policy Optimization; PPO to master Dota2; and RLlib
Share

In this issue:

  • we discuss what are trust region and proximal policy optimization;Β 

  • we explore RLlib – an open-source framework for highly scalable reinforcement learning;

  • we learn how OpenAI used PPO reinforcement learning to master Dota 2. Β 

Give a gift subscription

πŸ’‘Β ML Concept of the Day: What are Trust Region and Proximal Policy Optimization?

In Edge#97, we continue our series…

This post is for paid subscribers

Already a paid subscriber? Sign in
Β© 2025 Jesus Rodriguez
Privacy βˆ™ Terms βˆ™ Collection notice
Start writingGet the app
Substack is the home for great culture

Share