Skip to content

Latest commit

 

History

History
118 lines (73 loc) · 6.74 KB

README.md

File metadata and controls

118 lines (73 loc) · 6.74 KB

Reinforcement Learning Examples

medium Python3.8.6 PyTorch1.8.1

Pong environment

Animation

Policy Gradients
Checkpoint weights


Lunar Lander environment

Animation

Deep Q-Network
Checkpoint weights

Policy Gradients
Checkpoint weights


Cartpole environment

Animation

Policy Gradients
Checkpoint weights

Deep Q-Network
Checkpoint weights


Mario environment

Animation


Policy Gradients
Checkpoint weights

Plot of average reward per 10 episodes


Double Deep Q-Network
Checkpoint weights

Plot of average reward per 10 episodes


PPO+GAE
Checkpoint weights

Plot of average reward per 10 episodes


Highway environments

video.mp4

Double Deep Q-Network
Checkpoint weights


video.mp4

Double Deep Q-Network
Checkpoint weights


video.mp4

Double Deep Q-Network
Checkpoint weights


video.mp4

Double Deep Q-Network
Checkpoint weights


video.mp4

PPO+GAE
Checkpoint weights


PyBullet Walker2D environment

video.mp4

PPO+GAE
Checkpoint weights

Plot of average reward per 50 episodes