PyTorch implementation of the CQL algorithm (Paper). Including the discrete action space DQN-CQL version, the continuous action space SAC-CQL version and a discrete CQL-SAC implementation.
-> conda environment [ ] -> requirement.txt [ ]
Select the folder [CQL-DQN, CQL-SAC, CQL-SAC-discrete] of the algorithm you want to train and run: python train.py
Comparison of a discrete CQL-SAC implementations vs the normal discrete SAC.
Find all training results and hyperparameter in the wandb project.
- update readme [ ]
- add distributional Q-Function [ ]
Im open for feedback, found bugs, improvements or anything. Just leave me a message or contact me.
- Sebastian Dittert
Feel free to use this code for your own projects or research.
@misc{SAC,
author = {Dittert, Sebastian},
title = {CQL},
year = {2021},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/BY571/CQL}},
}