Implementation of Proximal Policy Optimization (PPO) using the reinforcement learning framework RLKit by vitchyr. Installation for RLKit is specified in the original README for RLKit.
- Proximal Policy Optimization (PPO)
- Example script
- Paper
- Other References
Run the following command:
python examples/ppo.py
Here is an example implementation result on the OpenAI Gym environment, Bipedal Walker-v2: