Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
reinforcement-learning
deep-reinforcement-learning
pytorch
gym
frozenlake-v0
proximal-policy-optimization
ppo
cartpole-v0
lunar-lander
random-network-distillation
bipedalwalker
ppo-rnd
frozenlake-not-slippery
-
Updated
Dec 31, 2020 - Python