MalmoRL

A framework for training Reinforcement Learning agents in Minecraft with Project Malmö. I've built it for my own research and I hope it's useful to others as well. It is partially based on code provided with the The Malmo Collaborative AI Challenge, extended to support more Malmö mission environments. It should also be easy to extend further to support the needs of more environments.

Work in progress...

Define a mission

Create missions/<your_mission>.py. Inside it you must define 3 classses:

Mission, where you should define at least the mission_name the agent_names and the mission_xml description.
MissionEnvironment, where you should define at least the available actions in the environment. You can optionally define several other aspects of the environment, like how you want actions sent by the agent to be handled etc. by overriding the respective methods.
MissionStateBuilder, where you can define the states (frames, observations etc.) produced by the environment. You must override the build() method to create and return states to the agent.

Take a look at the included missions/classroom.py and missions/multi_agent.py for more concrete examples.

Define an agent

New agents should extend BaseAgent and override fit(), test(), save() and load() methods for training, testing, saving and loading the agent respectively. You can look at the included agents in malmo_rl for examples.

Run an experiment

You can look at the included run_classroom.py and run_multi_agent.py for how to make your own script for your custom experiment but you don't necessarily have to follow them. The scripts expect a list of Malmö clients defined in clients.txt. There must be at least as many clients as there are agents in the mission.

Use included agents

malmo_rl includes 3 agents based on my fork of keras-rl:

Random agent
Double Dueling DQN (D-DDQN) with recurrent network support
Deep Deterministic Policy Gradient (DDPG) with recurrent network support

You can run classroom_train_dqn.sh or classroom_train_ddpg.sh to train DQN and DDPG respectively on the Classroom mission. You can also run multi_agent_random.sh to test a mission with 2 random agents and an overhead observer.

Requirements

Python 2.7 or 3.5
Project Malmö
(Optional) keras-rl

Extra

If you want to use the environments shown in the .gifs you can download them here and extract them in your <malmo_dir>/Minecraft/run/saves folder.

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
malmo_rl		malmo_rl
malmopy		malmopy
missions		missions
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
classroom_train_ddpg.sh		classroom_train_ddpg.sh
classroom_train_dqn.sh		classroom_train_dqn.sh
clients.txt		clients.txt
common.py		common.py
dqn_ddpg.png		dqn_ddpg.png
labyrinth_dqn.gif		labyrinth_dqn.gif
mission.py		mission.py
multi_agent_random.sh		multi_agent_random.sh
obstacles_dqn.gif		obstacles_dqn.gif
pools_dqn.gif		pools_dqn.gif
rooms_dqn.gif		rooms_dqn.gif
run_classroom.py		run_classroom.py
run_multi_agent.py		run_multi_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MalmoRL

Define a mission

Define an agent

Run an experiment

Use included agents

Requirements

Extra

About

Releases

Packages

Languages

License

petrosgk/MalmoRL

Folders and files

Latest commit

History

Repository files navigation

MalmoRL

Define a mission

Define an agent

Run an experiment

Use included agents

Requirements

Extra

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages