Easy21 Implementation

This is an implementation of the Easy21 assignment of David Silver's Reinforcement Learning Course at UCL. The assignment can be found here.

Monte-Carlo Control

python3 mc.py

10 Million Episodes of the game have been evaluated, to obtain the following Value function:

TD Learning

python3 td.py

Mean Squared Error of the state-action function of the Monte-Carlo experiment with different Lambdas. For each lambda, 10 000 Episodes have been evaluated.

Mean Squared Error evolution with different Lambdas.

Linear Function Approximation

python3 lfa.py

The lookup table of the previous experiment is replaced with a linear function approximation. The logic for the feature vector can be found in the assignment.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
figs		figs
Q.dill		Q.dill
README.md		README.md
environment.py		environment.py
lfa.py		lfa.py
mc.py		mc.py
td.py		td.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Easy21 Implementation

Monte-Carlo Control

TD Learning

Linear Function Approximation

About

Releases

Sponsor this project

Packages

Languages

timbmg/easy21-rl

Folders and files

Latest commit

History

Repository files navigation

Easy21 Implementation

Monte-Carlo Control

TD Learning

Linear Function Approximation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages