Hostname: page-component-5db58dd55d-bthnr Total loading time: 0 Render date: 2026-06-03T06:59:20.052Z Has data issue: false hasContentIssue false

Bayesian parameter inference for shallow subsurface modeling using field data and impacts on geothermal planning

Published online by Cambridge University Press:  02 November 2022

Monika J. Kreitmair*
Affiliation:
Department of Engineering, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
Nikolas Makasis
Affiliation:
Department of Engineering, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
Kathrin Menberg
Affiliation:
Institute of Applied Geosciences, Karlsruhe Institute of Technology, 76131 Karlsruhe, Germany
Asal Bidarmaghz
Affiliation:
School of Civil and Environmental Engineering, University of New South Wales, Sydney, Australia
Gareth J. Farr
Affiliation:
British Geological Survey, Cardiff University, Cardiff CF10 3AT, United Kingdom The Coal Authority, Mansfield, Nottinghamshire NG18 4RG, United Kingdom
David P. Boon
Affiliation:
British Geological Survey, Cardiff University, Cardiff CF10 3AT, United Kingdom
Ruchi Choudhary
Affiliation:
Department of Engineering, University of Cambridge, Cambridge CB2 1PZ, United Kingdom Data-centric Engineering, Alan Turing Institute, British Library, London NW1 2DB, United Kingdom
*
*Corresponding author. E-mail: mk2040@cam.ac.uk

Abstract

Understanding the subsurface is crucial in building a sustainable future, particularly for urban centers. Importantly, the thermal effects that anthropogenic infrastructure, such as buildings, tunnels, and ground heat exchangers, can have on this shared resource need to be well understood to avoid issues, such as overheating the ground, and to identify opportunities, such as extracting and utilizing excess heat. However, obtaining data for the subsurface can be costly, typically requiring the drilling of boreholes. Bayesian statistical methodologies can be used towards overcoming this, by inferring information about the ground by combining field data and numerical modeling, while quantifying associated uncertainties. This work utilizes data obtained in the city of Cardiff, UK, to evaluate the applicability of a Bayesian calibration (using GP surrogates) approach to measured data and associated challenges (previously not tested) and to obtain insights on the subsurface of the area. The importance of the data set size is analyzed, showing that more data are required in realistic (field data), compared to controlled conditions (numerically-generated data), highlighting the importance of identifying data points that contain the most information. Heterogeneity of the ground (i.e., input parameters), which can be particularly prominent in large-scale subsurface domains, is also investigated, showing that the calibration methodology can still yield reasonably accurate results under heterogeneous conditions. Finally, the impact of considering uncertainty in subsurface properties is demonstrated in an existing shallow geothermal system in the area, showing a higher than utilized ground capacity, and the potential for a larger scale system given sufficient demand.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Outlines of buildings with basements (a), geological distribution (b), and hydraulic head distribution (c) within the study domain. Groundwater levels are pumped within dashed regions (copyright BGS, UKRI). © Crown copyright and database rights 2021 Ordnance Survey [100021290 EUL]. Use of this data is subject to terms and conditions.

Figure 1

Table 1. Thermal and hydraulic properties of geological materials present in the modeled domain and concrete material used for heat sources (SimonHydrotechnica, 1993; Howard et al., 2008; Dalla Santa et al., 2020; Hobbs et al., 2002; Parkes et al., 2020; Boon et al., 2021). Thermal diffusivity was calculated according to . Where appropriate, top values in a row represent partially saturated conditions and bottom values fully saturated conditions.

Figure 2

Figure 2. Measurement locations in domain with color denoting depth of sensor (left panel) and examples of data fitting (right two panels). © Crown copyright and database rights 2021 Ordnance Survey [100021290 EUL]. Use of this data is subject to terms and conditions.

Figure 3

Table 2. Ranking of uncertain parameters considered in the model according to mean temperature sensitivity.

Figure 4

Figure 3. Schematic of the three models with different levels of fidelity (number of modeling planes). The 2D planes are connected to their nearest neighbor by convective heat flux transfer and temperature boundary conditions are applied at the top- and bottom-most planes.

Figure 5

Figure 4. Results of the Morris method sensitivity analysis of parameter impact on mean temperature, shown as box plots of the effective mean across the 24 measurement locations and indicating the average value as a black cross. The higher the effective mean, the more sensitive the model output is to changes in this parameter, giving it a higher ranking.

Figure 6

Figure 5. Mean temperature data determined from fitting the temperature output from the data-generating model (black crosses) and the field data temperature time-series (black asterisks) across the 24 measurement locations. Also shown is the output from the 20 LHS samples run using the high-fidelity model (colored pluses) and the 50 LHS samples using the low-fidelity model. The numerical model runs capture the extent of the measured and synthetic data well.

Figure 7

Figure 6. Parameter posterior and prior distributions for parameter values from calibration using numerically-generated (top row) and field (bottom row) temperature data sets. The posteriors for the synthetic data set show good narrowing around the input value (indicated by the vertical dashed line) and the posteriors inferred from the field measurements exhibit a reduction in uncertainty.

Figure 8

Figure 7. Dendrogram indicating spatial proximity of measurement points. Borehole indices colored blue indicate points located within the aquifer, and orange colored indicate points outwith the aquifer. The indices of the points removed upon each reduction in subset side, in order are 10, 14, 15, 18, 24, 9, 22, 7, 16, 11, 19, 2, 8, 23, 6, 5, 4, 17, 13, 3.

Figure 9

Figure 8. Box plots showing the progression of parameter calibration with increasing subset size for the three calibration parameters (left to right) for the single-valued synthetic data (top row) and the field data collected from boreholes (bottom row). The mean distribution values are marked with “x” for each case. The posteriors exhibit a narrowing around the input value with increasing data set size for the synthetic data and more clearly defined distribution behavior for the field-data.

Figure 10

Figure 9. Posteriors inferred from 20 unique field-data subsets consisting of 12 measurement locations. The variation in posterior distribution indicates that the different data points provide different information to the calibration.

Figure 11

Figure 10. Prior and posterior distributions for calibration performed on the “noisy” hydraulic conductivity output data set. The single-valued input parameters are well inferred and the posterior for the hydraulic conductivity approaches the input distribution.

Figure 12

Figure 11. Posteriors for regionally varying hydraulic conductivity synthetic data set, shown for calibration using regional data subsets. The dashed lines in the left and right-most panels indicate the “true” parameter value, used as an input in the data-generating model, and the black crosses in the central panel indicate the hydraulic conductivity values used in each of the four regions of the domain. The posteriors for the hydraulic conductivity approximate well the regional input values.

Figure 13

Figure 12. Comparison between mean temperatures outputted at the borehole locations for the regional ($ x $-axis) and the calibrated ($ y $-axis) data. The results obtained from the calibrated model are in good agreement (i.e., within 0.2 °C) of the numerically generated “field” measurements.

Figure 14

Table 3. Hydraulic conductivity ($ {k}_h $) values used in the analysis.

Figure 15

Figure 13. Comparison of geothermal energy potential for different values of $ {k}_h $. Results are shown in terms of the peak load for the current well spacing (left axis) and hypothetical maximum when the well spacing is such that no thermal interference happens between the wells (right axis).

Figure 16

Figure 14. Distance between abstraction and injection wells in order to avoid thermal interference in the open-loop system, for the different values of $ {k}_h $ used in this analysis (Banks, 2009).

Figure 17

Table 4. Uncertain hyper-parameters in the multi-fidelity framework and prior probability distributions.

Figure 18

Figure 15. Schematic of the modelling approach, showing the collection of 2D planes interconnected by heat flux transfer and the temperature boundary conditions at the top and bottom planes.

Figure 19

Figure 16. Trace plots for the calibrations using both the synthetically generated (top row) and ‘real’ field data (bottom row).

Figure 20

Table 5. Subset information for small-scale study

Figure 21

Figure 17. Hyper-parameter posteriors determined from calibrating on field-data using the multi-fidelity framework.

Submit a response

Comments

No Comments have been published for this article.