Hostname: page-component-77f85d65b8-hzqq2 Total loading time: 0 Render date: 2026-04-18T09:53:15.588Z Has data issue: false hasContentIssue false

Uncertainty quantification in tree structure and polynomial regression algorithms toward material indices prediction

Published online by Cambridge University Press:  10 March 2025

Geng-Fu He
Affiliation:
Department of Civil and Environmental Engineering, The Hong Kong, Polytechnic University, Hong Kong, China
Pin Zhang*
Affiliation:
Department of Engineering, University of Cambridge, Cambridge, UK Department of Civil and Environmental Engineering, National University of Singapore, Singapore
Zhen-Yu Yin*
Affiliation:
Department of Civil and Environmental Engineering, The Hong Kong, Polytechnic University, Hong Kong, China
*
Corresponding authors: Pin Zhang and Zhen-Yu Yin; Emails: pinzhang@nus.edu.sg; zhenyu.yin@polyu.edu.hk
Corresponding authors: Pin Zhang and Zhen-Yu Yin; Emails: pinzhang@nus.edu.sg; zhenyu.yin@polyu.edu.hk

Abstract

Machine learning’s integration into reliability analysis holds substantial potential to ensure infrastructure safety. Despite the merits of flexible tree structure and formulable expression, random forest (RF) and evolutionary polynomial regression (EPR) cannot contribute to reliability-based design due to absent uncertainty quantification (UQ), thus hampering broader applications. This study introduces quantile regression and variational inference (VI), tailored to RF and EPR for UQ, respectively, and explores their capability in identifying material indices. Specifically, quantile-based RF (QRF) quantifies uncertainty by weighting the distribution of observations in leaf nodes, while VI-based EPR (VIEPR) works by approximating the parametric posterior distribution of coefficients in polynomials. The compression index of clays is taken as an exemplar to develop models, which are compared in terms of accuracy and reliability, and also with deterministic counterparts. The results indicate that QRF outperforms VIEPR, exhibiting higher accuracy and confidence in UQ. In the regions of sparse data, predicted uncertainty becomes larger as errors increase, demonstrating the validity of UQ. The generalization ability of QRF is further verified on a new creep index database. The proposed uncertainty-incorporated modeling approaches are available under diverse preferences and possess significant prospects in broad scientific computing domains.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Schematic of RF.

Figure 1

Figure 2. Schematic of QRF.

Figure 2

Figure 3. Schematic of VIEPR.

Figure 3

Table 1. Statistics of soil indices

Figure 4

Figure 4. Statistics of soil indices. (a) Scatterplots. (b) Histograms.

Figure 5

Table 2. Hyperparameters of QRF and VIEPR

Figure 6

Table 3. Configuration of QRF and VIEPR based models

Figure 7

Figure 5. Evolution of loss value. (a) QRF. (b) VIEPR (number of transformed variables).

Figure 8

Figure 6. Structure of one tree of QRF.

Figure 9

Figure 7. Prior and posteriors of coefficients in VIEPR.

Figure 10

Figure 8. Performance of models in terms of (a) MAE and (b) R2.

Figure 11

Figure 9. Comparison between measured and predicted $ {C}_c $ with 95% CI generated by (a) QRF and (b) VIEPR.

Figure 12

Figure 10. Relationship of predicted error and uncertainty on e0. (a) Training set. (b) Testing set.

Figure 13

Figure 11. Performance of QRF in predicting $ {C}_{\unicode{x03B1}} $. (a) Accuracy. (b) Reliability.

Figure 14

Table A1. Configuration of PSO

Submit a response

Comments

No Comments have been published for this article.