Hostname: page-component-89b8bd64d-z2ts4 Total loading time: 0 Render date: 2026-05-11T11:53:27.276Z Has data issue: false hasContentIssue false

An environment-adaptive SAC-based HVAC control of single-zone residential and office buildings

Published online by Cambridge University Press:  08 January 2025

Xinlin Wang*
Affiliation:
Energy BU, CSIRO, Newcastle, 2304, NSW, Australia
Nariman Mahdavi
Affiliation:
Energy BU, CSIRO, Newcastle, 2304, NSW, Australia
Subbu Sethuvenkatraman
Affiliation:
Energy BU, CSIRO, Newcastle, 2304, NSW, Australia
Sam West
Affiliation:
Energy BU, CSIRO, Newcastle, 2304, NSW, Australia
*
Corresponding author: Xinlin Wang; Email: xinlin.wang@csiro.au

Abstract

This study introduces an advanced reinforcement learning (RL)-based control strategy for heating, ventilation, and air conditioning (HVAC) systems, employing a soft actor-critic agent with a customized reward mechanism. This strategy integrates time-varying outdoor temperature-dependent weighting factors to dynamically balance thermal comfort and energy efficiency. Our methodology has undergone rigorous evaluation across two distinct test cases within the building optimization testing (BOPTEST) framework, an open-source virtual simulator equipped with standardized key performance indicators (KPIs) for performance assessment. Each test case is strategically selected to represent distinct building typologies, climatic conditions, and HVAC system complexities, ensuring a thorough evaluation of our method across diverse settings. The first test case is a heating-focused scenario in a residential setting. Here, we directly compare our method against four advanced control strategies: an optimized rule-based controller inherently provided by BOPTEST, two sophisticated RL-based strategies leveraging BOPTEST’s KPIs as reward references, and a model predictive control (MPC)-based approach specifically tailored for the test case. Our results indicate that our approach outperforms the rule-based and other RL-based strategies and achieves outcomes comparable to the MPC-based controller. The second scenario, a cooling-dominated environment in an office setting, further validates the versatility of our strategy under varying conditions. The consistent performance of our strategy across both scenarios underscores its potential as a robust tool for smart building management, adaptable to both residential and office environments under different climatic challenges.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open materials
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Recent work in RL-based HVAC control

Figure 1

Figure 1. Framework of the RL agent interface with the BOPTEST framework via BOPTEST-Gym.

Figure 2

Table 2. Overview of BOPTEST test cases with configurations used in this study

Figure 3

Figure 2. The proposed reward mechanism.

Figure 4

Figure 3. Variation of thermal weight α(t) in response to seasonal outdoor temperature changes.

Figure 5

Table 3. Configuration of hyperparameters in our method and benchmark approaches

Figure 6

Figure 4. Simulation results for Test Case 1: Heating scenario during cold days.

Figure 7

Figure 5. Benchmark cumulative KPIs across the 2 weeks heating scenario in test case 1.

Figure 8

Figure 6. Simulation results for Test Case 2: Cooling dominated scenario.

Figure 9

Figure 7. Temperature Profiles and Variability: Comparative Analysis of two test cases.

Figure 10

Figure 8. Comparison of dynamic thermal weight $ \alpha (t) $versus fixed thermal weight $ \alpha = 0.5 $.

Figure 11

Figure 9. $ \delta $ sensitivity analysis.

Submit a response

Comments

No Comments have been published for this article.