Reinforcement Learning algorithms SARSA, Q-Learning, DQN, for Classical and MuJoCo Environments and testing them with OpenAI Gym.
-
Updated
Aug 1, 2018 - Jupyter Notebook
Reinforcement Learning algorithms SARSA, Q-Learning, DQN, for Classical and MuJoCo Environments and testing them with OpenAI Gym.
Tic-Tac-Toe agent trained by Deep Reinforcement Learning
Reinforcement Learning tool to show how λ and learning rate affect Q(λ). Demonstrates this through teaching a bot to solve a maze.
Repository of Reinforcement Learning projects done during the course @sapienza
Code for my paper: "Theta sequences as eligibility traces: a biological solution to credit assignment"
Distributed Deep Reinforcement Learning Framework
Replicates the Random Walk Experiments from Sutton's 1998 paper "Learning to predict by the methods of Temporal Differences"
Second homework for the Reinforcement Learning course
Machine-learning application in path finding using the n-step TD(lambda) algorithm
🤖 Implementation and short explanation of basic RL algorithms, reproducing the simulations from Andrej Kaparthy's REINFORCEjs library.
Add a description, image, and links to the td-lambda topic page so that developers can more easily learn about it.
To associate your repository with the td-lambda topic, visit your repo's landing page and select "manage topics."