Hostname: page-component-77f85d65b8-g4pgd Total loading time: 0 Render date: 2026-03-28T23:09:44.475Z Has data issue: false hasContentIssue false

Predicting concrete strength using machine learning and fresh-state measurements

Published online by Cambridge University Press:  13 August 2025

Bahdan Zviazhynski*
Affiliation:
Cavendish Laboratory, University of Cambridge , Cambridge, UK
Callum White
Affiliation:
Department of Engineering, University of Cambridge , Cambridge, UK
Janet M. Lees
Affiliation:
Department of Engineering, University of Cambridge , Cambridge, UK
Gareth J. Conduit
Affiliation:
Cavendish Laboratory, University of Cambridge , Cambridge, UK Intellegens, Cambridge, UK
*
Corresponding author: Bahdan Zviazhynski; Email: bogdanzwier@gmail.com

Abstract

Understanding the properties of lower-carbon concrete products is essential for their effective utilization. Insufficient empirical test data hinders practical adoption of these emerging products, and a lack of training data limits the effectiveness of current machine learning approaches for property prediction. This work employs a random forest machine learning model combined with a just-in-time approach, utilizing newly available data throughout the concrete lifecycle to enhance predictions of 28 and 56 day concrete strength. The machine learning hyperparameters and inputs are optimized through a novel unified metric that combines prediction accuracy and uncertainty estimates through the coefficient of determination and the distribution of uncertainty quality. This study concludes that optimizing solely for accuracy selects a different model than optimizing with the proposed unified accuracy and uncertainty metric. Experimental validation compares the 56-day strength of two previously unseen concrete mixes to the machine learning predictions. Even with the sparse dataset, predictions of 56-day strength for the two mixes were experimentally validated to within 90% confidence interval when using slump as an input and further improved by using 28-day strength.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Prevalence of model input variables used to predict compressive strength in the literature (Chou et al., 2014; Deng et al., 2018; Young et al., 2019; Feng et al., 2020; Hadzima-Nyarko et al., 2020; Pham et al., 2020; Khursheed et al., 2021; Nunez et al., 2021; Salami et al., 2021; Wan et al., 2021; Nguyen et al., 2021a; Han et al., 2022; Khan et al., 2022; Liu, 2022; Mansouri et al., 2022; Chi et al., 2023; Lee et al., 2023; Li et al., 2023; Pakzad et al., 2023; Hariri-Ardebili et al., 2024).

Figure 1

Figure 2. Life cycle of concrete, from mixing to the hardened state, with the step-wise prediction approach and decision tree highlighted at each concrete age.

Figure 2

Figure 3. (a) Correlation map for the OPC mixes within the dataset, (b) correlation map for the GGBS mixes within the dataset, and (c) correlation map for the unified dataset. The right-hand scale bar calibrates the correlation values.

Figure 3

Figure 4. (a) The two-layer random forest model. (b) A graphical description of a tree with $ \mathit{\min}\_ samples\_ leaf=4 $. Green boxes are tree leaves, black points are training data, the pink stepped line is the prediction of a single tree with all data present, and the pink shaded area is the range of the predictions from the ensemble of trees.

Figure 4

Figure 5. Binned distribution of $ \varepsilon $ (magenta rectangles) and standard normal distribution (white transparent rectangles).

Figure 5

Figure 6. (a) Schematic of leave-one-out cross-validation. Gray squares are entries in the existing data and magenta squares are the test entries for each fold. (b) Model performance for different values of $ \mathit{\min}\_ samples\_ leaf $.

Figure 6

Table 1. Model performance for different inputs and outputs

Figure 7

Figure 7. The predicted versus experimental 56 day strength with model uncertainty (blue points). The magenta line shows the ideal trend.

Figure 8

Table 2. Comparison of Mix A, Mix B, and training data parameters

Figure 9

Table 3. Mix composition and fresh and hardened testing results for the experimental series

Figure 10

Figure 8. Predictions of 28-day strength for (a) Mix A using the model without slump (blue) and model with slump (orange) and (b) Mix B using the model without slump (blue) and model with slump (orange). Small error bars represent standard deviation of predictions, large error bars correspond to 90% confidence intervals of predictions. Dashed vertical line is the experimental value, and gray shaded area is 90% confidence interval of the experimental value.

Figure 11

Figure 9. Predictions of 56 day strength for (a) Mix A using the model without slump that optimizes $ \alpha $ (blue), model without slump that optimizes $ {R}^2 $ (purple), model with slump that optimizes $ \alpha $ (orange), and model with slump and 28 day strength that optimizes $ \alpha $ (green); (b) Mix B using the model without slump that optimizes $ \alpha $ (blue), model without slump that optimizes $ {R}^2 $ (purple), model with slump that optimizes $ \alpha $ (orange), and model with slump and 28 day strength that optimizes $ \alpha $ (green). Small error bars represent standard deviation of predictions, large error bars correspond to 90% confidence intervals of predictions. Dashed vertical line is the experimental value, and gray shaded area is 90% confidence interval of the experimental value.

Supplementary material: File

Zviazhynski et al. supplementary material

Zviazhynski et al. supplementary material
Download Zviazhynski et al. supplementary material(File)
File 56 KB
Submit a response

Comments

No Comments have been published for this article.