Model Based Reinforcement Learning Approach to HVAC optimal control

Repo structure

Multi-Objective MDP formulation with objectives as thermal comfort and energy consumption
Lagrangian dual reinforcement learning approach
fine tuning left to do

Single objective MDP of energy consumption, and thermal comfort enforced through hard constraint
action bound approach
in progress: Inferring change of environment to adjust the mask accordingly

single objective reinforcement learning formulation (electric cost), with demand response
demand response, Toronto Hydro electricity ToU (time of use)
To reduce HVAC actuation load, CAPS action smoothing utilized

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
agent		agent
base		base
mask-agent		mask-agent
.gitignore		.gitignore
GHTIn.idf		GHTIn.idf
README.md		README.md
dqn.py		dqn.py
in.idf		in.idf
info.txt		info.txt
noisy_dqn.py		noisy_dqn.py
weather.epw		weather.epw