Hostname: page-component-77f85d65b8-pkds5 Total loading time: 0 Render date: 2026-03-28T17:15:02.628Z Has data issue: false hasContentIssue false

Office-in-the-Loop: an investigation into Agentic AI for advanced building HVAC control systems

Published online by Cambridge University Press:  27 June 2025

Tomoya Sawada*
Affiliation:
DX Innovation Center, Mitsubishi Electric Corporation , Yokohama, Japan
Masahiro Mizuno
Affiliation:
DX Innovation Center, Mitsubishi Electric Corporation , Yokohama, Japan
Takaomi Hasegawa
Affiliation:
Matsuo Institute, Inc, Tokyo, Japan
Keiichi Yokoyama
Affiliation:
Matsuo Institute, Inc, Tokyo, Japan
Mayuka Kono
Affiliation:
Matsuo Institute, Inc, Tokyo, Japan Division of Information Science, Nara Institute of Science and Technology , Ikoma, Japan
*
Corresponding author: Tomoya Sawada; Email: tsawada01@gmail.com

Abstract

Heating, Ventilation, and Air Conditioning (HVAC) systems are major energy consumers in buildings, challenging the balance between efficiency and occupant comfort. While prior research explored generative AI for HVAC control in simulations, real-world validation remained scarce. This study addresses this gap by designing, deploying, and evaluating “Office-in-the-Loop,” a novel cyber-physical system leveraging generative AI within an operational office setting. Capitalizing on multimodal foundation models and Agentic AI, our system integrates real-time environmental sensor data (temperature, occupancy, etc.), occupants’ subjective thermal comfort feedback, and historical context as input prompts for the generative AI to dynamically predict optimal HVAC temperature setpoints. Extensive real-world experiments demonstrate significant energy savings (up to 47.92%) while simultaneously improving comfort (up to 26.36%) compared to baseline operation. Regression analysis confirmed the robustness of our approach against confounding variables like outdoor conditions and occupancy levels. Furthermore, we introduce Data-Driven Reasoning using Agentic AI, finding that prompting the AI for data-grounded rationales significantly enhances prediction stability and enables the inference of system dynamics and cost functions, bypassing the need for traditional reinforcement learning paradigms. This work bridges simulation and reality, showcasing generative AI’s potential for efficient, comfortable building environments and indicating future scalability to large systems like data centers.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. CG rendering of the office used in this experiment, and the opposing view.

Figure 1

Figure 2. The overhead map that shows the location of office workers and the spatial distribution of office temperature via MELRemo-IPS IoT sensor device. Triangles ($ \varDelta $) indicate the installation locations of the MELRemo-IPS sensors in Figure 3a, while inverted triangles () represent the positions of the Soracom IoT sensors in Figure 3b. Human figures represent the location of office workers. The color distribution visualizes the estimated temperature in degrees Celsius.

Figure 2

Figure 3. In this experiment, (a) was used to track occupant locations and estimate spatial temperature distribution, while (b) was employed for measuring illuminance and obtaining pinpoint temperature.

Figure 3

Figure 4. Conceptual diagram of the proposed system, illustrating the cyber-physical loop: real-world office environment data is collected, used by generative AI in the cloud to predict optimal HVAC temperatures, and then applied to control the physical HVAC system.

Figure 4

Figure 5. Overview of our system for HVAC control using a multimodal foundation model. Leveraging generative AI as a simulator to achieve optimal HVAC forecasting that adapts to dynamically changing real-world office environments.

Figure 5

Figure 6. Our prompt engineering for controlling HVAC.

Figure 6

Table 1. Comparison of energy efficiency and comfort

Figure 7

Figure 7. Box plots of electricity consumption and occupant feedback across experimental conditions.

Figure 8

Figure 8. Office temperature and HVAC setpoint are shown for seven areas, comparing different experimental methodologies (columns) and areas (rows). The x-axis represents time of day, and the y-axis represents temperature. Blue lines indicate room temperature, green lines HVAC setpoint, and red dots HVAC shutdowns, respectively.

Figure 9

Figure 9. Experiments conducted on 29 February in area 5.

Figure 10

Figure 10. Experiments with LLMs and MFMs conducted on 5 February.

Figure 11

Figure 11. Differences in (a) measured electricity consumption and (b) occupant feedback across experimental conditions.

Figure 12

Figure 12. Correlation map between (a) occupant feedback (F) and (b) predicted optimal HVAC settings (P) for each office area (number). Redder hues correspond to positive correlations.

Figure 13

Table 2. Spearman’s correlation coefficients between factors

Figure 14

Table 3. Correlation analysis between factors/conditions and measured energy consumption

Figure 15

Table 4. Results of the regression analysis

Figure 16

Table 5. Variations in daily predictions across different generative AI models on 7 March

Figure 17

Table 6. Results of querying Gemini1.5 (GeminiTeam, 2024a) on March 7 (Turn 4 prompt) for the rationale behind HVAC settings

Figure 18

Figure 13. Box plot of HVAC temperature predictions from 10 inferences using Self-Consistency without Data-Driven Reasoning.

Figure 19

Table 7. Performance comparison of different models for HVAC control tasks

Figure 20

Table 8. Model comparison of estimated energy consumption and reduction from baseline

Figure 21

Table 9. Results of querying GPT-4o (OpenAI, 2024) on 7 March (Turn 5 prompt) for the rationale behind HVAC settings with mathematical reasoning

Figure 22

Table 10. Results of querying OpanAI o1 (OpenAI, 2025) on 7 March (Turn 5 prompt) for the rationale behind HVAC settings with mathematical reasoning

Figure 23

Table 11. Results of querying Gemini2.0 (Pichai et al., 2024) on 7 March (Turn 5 prompt) for the rationale behind HVAC settings with mathematical reasoning

Figure 24

Figure 14. Limitations in (a) the time and (b) space of the feedback.

Figure 25

Figure 15. Example of predicting clothing thermal insulation using GPT-4V (Yang et al., 2023b) from camera images. (a) Input image and prompt. (b) GPT-4V’s generated description of insulation performance.

Supplementary material: File

Sawada et al. supplementary material

Sawada et al. supplementary material
Download Sawada et al. supplementary material(File)
File 10.2 KB
Submit a response

Comments

No Comments have been published for this article.