Reinforcement Learning-based Parameter Control for Evolutionary and Swarm-based Algorithms

This repository stores the implementation of the parameter control method proposed in the paper "Distributed Reinforcement Learning forOut-of-the-box Parameter Control in Evolutionaryand Swarm-based Algorithms" submitted to the IEEE Transactions on Evolutionary Computation, still waiting for approval.

These are the softwares and their respective versions that I used in the experiments to produce the paper:

Python 3.6.6;
GCC 7.3.0;
OpenMPI 3.1.1;
Ray 0.8.4 (RLLib);
Tensorflow 2.1.0;
Numpy1.18.1;
Scipy 1.4.1;
gym 0.15.4.

The proposed controller was implemented using the library Ray, where distributed implementations of the TD3 and PBT algorithms are available. The metaheuristics were implemented manually and are available in the file metaheuristics.py. The file metaheuristic_environment.py implements the gym.Env interface so that it can be used as environment for RL algorithms implemented in the library Ray. The file trainable_wrapper.py implements the interface ray.tune.Trainable so that it can be used as trainable by the PBT algorithm implemented in the module ray.tune. The file run_rl.py starts an experiment passing the metaheuristic, the RL algorithms, the optimization problem and their parameters. The parameters of the TD3 and ApexDDPG are hardcoded (it will be different in the future). You can use this file as an example of an experiment so that you can build yours, or call it as the following example:

python run_rl.py "127.0.0.1:6379" td3.TD3Trainer HCLPSO hclpso_config.json cec17_config/cec17_func1_10dim.json CEC17 16 2 td3_hclpso_func1 validation_data 1

It is important to mention that it assumes that there is a Ray's head node started running locally in the port 6379. You will find an example of the command to start such a node:

ray start --block --head --redis-port=6379 --redis-password="123456" --memory=20000000000 --object-store-memory=20000000000 --num-cpus=48 &

Also, it is necessary to build the file cec17_test_func.c as follows (if you want to use any of the CEC17 benchmark functions as the optimization problem):

gcc -fPIC -shared -lm -o cec17_test_func.so cec17_test_func.c

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
cec17_config		cec17_config
knapsack_config		knapsack_config
knapsack_problem		knapsack_problem
tsp_config		tsp_config
vlsi_tsp/mar_tsp		vlsi_tsp/mar_tsp
LICENSE		LICENSE
README.md		README.md
aco_tsp_config.json		aco_tsp_config.json
binary_ga_config.json		binary_ga_config.json
cec17.py		cec17.py
cec17_test_func.c		cec17_test_func.c
de_config.json		de_config.json
fss_config.json		fss_config.json
hclpso_config.json		hclpso_config.json
instances_description.ods		instances_description.ods
knapsack.py		knapsack.py
metaheuristic_environment.py		metaheuristic_environment.py
metaheuristics.py		metaheuristics.py
run_rl.py		run_rl.py
trainable_wrapper.py		trainable_wrapper.py
tsp.py		tsp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning-based Parameter Control for Evolutionary and Swarm-based Algorithms

About

Releases

Packages

Languages

License

lacerdamarcelo/rl_based_parameter_control_ea_si

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning-based Parameter Control for Evolutionary and Swarm-based Algorithms

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages