The Smart Buildings Control Suite: A Diverse Open Source Benchmark to Evaluate and Scale HVAC Control Policies for Sustainability
Judah Goldfeder, Victoria Dean, Zixin Jiang, Xuezheng Wang, Bing dong, Hod Lipson, John Sipple
TL;DR
The paper tackles scaling HVAC optimization by introducing The Smart Buildings Control Suite, an open-source benchmark with real-world data (11 buildings over 6 years), a scalable data-driven simulator, and a PINN-based simulator alternative. It models energy and emissions optimization as an MDP with a multi-objective reward, the 3C Reward, enabling offline RL and MPC approaches while remaining compatible with OpenAI Gym and TensorFlow Datasets. The authors demonstrate a fast, calibratable FD-based simulator and a modular PINN that matches real-world dynamics more efficiently than purely data-driven models, and they show that SAC can surpass a baseline controller in a representative building. Overall, the benchmark aims to accelerate scalable, reproducible HVAC control research and facilitate transition from lab to real-world buildings.
Abstract
Commercial buildings account for 17% of U.S. carbon emissions, with roughly half of that from Heating, Ventilation, and Air Conditioning (HVAC). HVAC devices form a complex thermodynamic system, and while Model Predictive Control and Reinforcement Learning have been used to optimize control policies, scaling to thousands of buildings remains a significant unsolved challenge. Most current algorithms are over-optimized for specific buildings and rely on proprietary data or hard-to-configure simulations. We present the Smart Buildings Control Suite, the first open source interactive HVAC control benchmark with a focus on solutions that scale. It consists of 3 components: real-world telemetric data extracted from 11 buildings over 6 years, a lightweight data-driven simulator for each building, and a modular Physically Informed Neural Network (PINN) building model as a simulator alternative. The buildings span a variety of climates, management systems, and sizes, and both the simulator and PINN easily scale to new buildings, ensuring solutions using this benchmark are robust to these factors and only reliant on fully scalable building models. This represents a major step towards scaling HVAC optimization from the lab to buildings everywhere. To facilitate use, our benchmark is compatible with the Gym standard, and our data is part of TensorFlow Datasets.
