NeurIPS 2024
Fan-Yun Sun, Harini S I, Alex Zook, Jonathan Tremblay, Logan Cross, Jiajun Wu, Nick Haber
Refer to the README under the directory factorsim
.
$ cd factorsim
$ ./go.sh GAME_NAME
Refer to rl_training/rl_train.sh.
To train RL policies on the PLE environments ("ground-truth" environments used in the paper), run
$ cd rl_training
$ ./rl_train.sh pong ppo gt --train_on_ple
To export video trajectory of a policy
$ cd rl_training
$ python -m utils.export_video pong