Hostname: page-component-5db58dd55d-smskv Total loading time: 0 Render date: 2026-05-30T14:36:16.878Z Has data issue: false hasContentIssue false

Explainable deep convolutional learning for intuitive model development by non–machine learning domain experts

Published online by Cambridge University Press:  05 October 2020

Sundaravelpandian Singaravel*
Affiliation:
Architectural Engineering Division, KU Leuven, Leuven, Belgium
Johan Suykens
Affiliation:
ESAT-STADIUS, KU Leuven, Leuven, Belgium
Hans Janssen
Affiliation:
Department of Civil Engineering, Building Physics Section, KU Leuven, Leuven, Belgium
Philipp Geyer
Affiliation:
Architectural Engineering Division, KU Leuven, Leuven, Belgium
*
Corresponding author Sundaravelpandian Singaravel sundaravelpandian@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

During the design stage, quick and accurate predictions are required for effective design decisions. Model developers prefer simple interpretable models for high computation speed. Given that deep learning (DL) has high computational speed and accuracy, it will be beneficial if these models are explainable. Furthermore, current DL development tools simplify the model development process. The article proposes a method to make the learning of the DL model explainable to enable non–machine learning (ML) experts to infer on model generalization and reusability. The proposed method utilizes dimensionality reduction (t-Distribution Stochastic Neighbour Embedding) and mutual information (MI). Results indicate that the convolutional layers capture design-related interpretations, and the fully connected layer captures performance-related interpretations. Furthermore, the global geometric structure within a model that generalized well and poorly is similar. The key difference indicating poor generalization is smoothness in the low-dimensional embedding. MI enables quantifying the reason for good and poor generalization. Such interpretation adds more information on model behaviour to a non-ML expert.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright
© The Author(s), 2020. Published by Cambridge University Press
Figure 0

Figure 1. Architecture of the CNN for design energy performance prediction.

Figure 1

Table 1. Building design parameters and sampling range.

Figure 2

Figure 2. Illustration of t-SNE – Conversion of two-dimensional data (left) into one-dimensional embedding (right).

Figure 3

Figure 3. Mean-squared-error (MSE) and mutual information (MI) for different model complexity.

Figure 4

Figure 4. Training and test design cases.

Figure 5

Table 2. Architecture of the deep learning model.

Figure 6

Figure 5. Cooling model test mean absolute percentage error (MAPE) in 100 training runs. The models highlighted in green, generalized well and poorly. The highlighted models are used in Section 5.2 for analysis.

Figure 7

Figure 6. Heating model test mean absolute percentage error (MAPE) in 100 training runs. The models highlighted in green, generalized well and poorly. The highlighted models are used in Section 5.2 for analysis.

Figure 8

Figure 7. Cooling model: Low-dimensional embedding overlaid with information about building design with different types of chiller. The black star and dot correspond to designs with low energy demand. Similarly, the red star and dot correspond to design options with high energy demand. Table 3 shows the engineering description of the highlighted design options.

Figure 9

Figure 8. Heating model: Low-dimensional embedding overlaid with information about building design with different types of the boiler pump. The black star and dot correspond to designs with low energy demand. Similarly, the red star and dot correspond to design options with high energy demand.

Figure 10

Table 3. Design options highlighted in Figure 7.

Figure 11

Figure 9. Cooling model: Low embedding’s dependency plot overlaid with information of chiller type (top) model with good generalization (bottom) model with poor generalization.

Figure 12

Figure 10. Heating model: Low embedding’s dependency plot overlaid with information of the boiler pump type (top) model with good generalization (bottom) model with poor generalization.

Figure 13

Figure 11. Test mean absolute error percentage (MAPE) versus mutual information (top) for the cooling model, (bottom) for the heating model. The cross in the figures highlights the MAPE and mutual information (MI) for a model with good generalization. It can be noted from the cross that a model with good generalization has neither low nor high MI.