Clean PyTorch implementations of imitation and reward learning algorithms
-
Updated
Aug 6, 2024 - Python
Clean PyTorch implementations of imitation and reward learning algorithms
Experiments in applying interpretability techniques to learned reward functions.
Version of the PST for DIVA, implemented in E-Prime.
Add a description, image, and links to the reward-learning topic page so that developers can more easily learn about it.
To associate your repository with the reward-learning topic, visit your repo's landing page and select "manage topics."