Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
reinforcement-learning deep-reinforcement-learning pytorch gym frozenlake-v0 proximal-policy-optimization ppo cartpole-v0 lunar-lander random-network-distillation bipedalwalker ppo-rnd frozenlake-not-slippery
-
Updated
Dec 31, 2020 - Python