Diffused Task-Agnostic Milestone Planner

This is an official GitHub Repository for the paper:

Mineui Hong, Minjae Kang, and Songhwai Oh, "Diffused Task-Agnostic Milestone Planner," in Proc. of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023), Dec. 2023.

How to run experiments

1. Requirements

Please note that the installation of the D4RL environments and CALVIN benchmark are not included in requirements.txt. We recommend you to install the D4RL environments from D4RL repo, and CALVIN benchmark from CALVIN repo. However, installing D4RL environments and CALVIN environment in the same virtual environment might cause conflict of dependencies. We recommend to make two separated virtual environments for D4RL and CALVIN experiments. We also note that we utilize dataset provided by TACO-RL repo for CALVIN experiments, which has slightly different training/validation split.

2. Training

Before running the scripts, you should set environment variable PYTHONPATH.

export PYTHONPATH=$PYTHONPATH:/{path}/{to}/{dtamp}

To train DTAMP for the D4RL environments, run:

python scripts/d4rl/train_dtamp.py --env {env_name}

To train DTAMP for the CALVIN benchmark, first run preprocess_calvin_data.py for preprocessing the dataset:

python scripts/calvin/preprocess_calvin_data.py --source_data_dir {where}/{tacorl_data}/{saved} --target_data_dir {where}/{to_save}/{processed_data}

Then, you should train PlayLMP model first to learn skill representations:

python scripts/calvin/train_lmp.py --data_dir {where}/{processed_data}/{saved}; python scripts/calvin/add_skills_to_calvin_dataset.py --data_dir {where}/{processed_data}/{saved}

Now you can finally train DTAMP:

python scripts/calvin/train_dtamp.py --data_dir {where}/{processed_data}/{saved} --lmp_dir {where}/{lmp_checkpoint}/{saved}

3. Evaluation

To evaluate the trained model, run:

python scripts/d4rl/evaluate_dtamp.py --env {env_name} --checkpoint_dir {checkpoint}/{dir}

or

python scripts/calvin/evaluate_dtamp.py --calvin_dir {calvin_env}/{root}/{pth} --data_dir {where}/{data}/{saved} --checkpoint_dir {checkpoint}/{dir} --tasks_per_rollout {1 or 2 or 3}

Reference

@inproceedings{hong2023dtamp,
author={Mineui Hong and Minjae Kang and Songhwai Oh},
title={Diffused Task-Agnostic Milestone Planner},
journal={Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS)},
year={2023}
}

Contact

If you have any problem, please contact to mineui.hong@rllab.snu.ac.kr.

Acknowledgements

The codebase of diffusion model is based on decision-diffuser repo and diffuser repo.

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
config		config
datasets		datasets
envs		envs
models		models
networks		networks
scripts		scripts
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffused Task-Agnostic Milestone Planner

How to run experiments

1. Requirements

2. Training

3. Evaluation

Reference

Contact

Acknowledgements

About

Releases

Packages

Contributors 3

Languages

rllab-snu/DTAMP

Folders and files

Latest commit

History

Repository files navigation

Diffused Task-Agnostic Milestone Planner

How to run experiments

1. Requirements

2. Training

3. Evaluation

Reference

Contact

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages