Hostname: page-component-77f85d65b8-g4pgd Total loading time: 0 Render date: 2026-04-22T18:31:04.924Z Has data issue: false hasContentIssue false

Data-driven surrogate modeling and benchmarking for process equipment

Published online by Cambridge University Press:  04 September 2020

Gabriel F. N. Gonçalves
Affiliation:
Department of Chemical Engineering, Imperial College London, London, United Kingdom
Assen Batchvarov
Affiliation:
Department of Chemical Engineering, Imperial College London, London, United Kingdom
Yuyi Liu
Affiliation:
Department of Chemical Engineering, Imperial College London, London, United Kingdom
Yuxin Liu
Affiliation:
Department of Chemical Engineering, Imperial College London, London, United Kingdom
Lachlan R. Mason
Affiliation:
Data Centric Engineering Program, The Alan Turing Institute, London, United Kingdom
Indranil Pan
Affiliation:
Data Centric Engineering Program, The Alan Turing Institute, London, United Kingdom Centre for Environmental Policy, Imperial College London, London, United Kingdom
Omar K. Matar*
Affiliation:
Department of Chemical Engineering, Imperial College London, London, United Kingdom
*
*Corresponding author. E-mail: o.matar@imperial.ac.uk

Abstract

In chemical process engineering, surrogate models of complex systems are often necessary for tasks of domain exploration, sensitivity analysis of the design parameters, and optimization. A suite of computational fluid dynamics (CFD) simulations geared toward chemical process equipment modeling has been developed and validated with experimental results from the literature. Various regression-based active learning strategies are explored with these CFD simulators in-the-loop under the constraints of a limited function evaluation budget. Specifically, five different sampling strategies and five regression techniques are compared, considering a set of four test cases of industrial significance and varying complexity. Gaussian process regression was observed to have a consistently good performance for these applications. The present quantitative study outlines the pros and cons of the different available techniques and highlights the best practices for their adoption. The test cases and tools are available with an open-source license to ensure reproducibility and engage the wider research community in contributing to both the CFD models and developing and benchmarking new improved algorithms tailored to this field.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2020. Published by Cambridge University Press in association with Data-Centric Engineering
Figure 0

Figure 1. Workflow utilized for building the surrogate models for computational fluid dynamics (CFD) simulations through the active learning methodology.

Figure 1

Table 1. Number of fitting parameters used in each regression technique.

Figure 2

Figure 2. Schematic representation of the static mixer geometry used in Case 1 with side- and front-views shown in (a) and (b), respectively.

Figure 3

Figure 3. Computational fluid dynamics simulation of turbulent scalar transport through a static mixer using the $ k $-$ \varepsilon $ model (Launder and Spalding, 1974) for Case 1 showing steady, two-dimensional nondimensional velocity magnitude (a), turbulent kinetic energy (b), and scalar concentration (c), fields, generated with $ \mathit{\operatorname{Re}}={10}^5 $, $ L/D=0.3 $, and $ \theta =\pi /4 $. The scale bars represent the magnitude of the fields depicted in each panel.

Figure 4

Figure 4. Schematic representation of the orifice geometry used in Case 2 with side- and front-views shown in (a) and (b), respectively.

Figure 5

Figure 5. A comparison of the current predictions of pressure drop across the orifice as a function of the local Reynolds number with the experimental data of Fossa and Guglielmini (2002) with $ {\left(d/D\right)}^2=0.54 $ and $ s/d=0.20 $ (see Figure 4b).

Figure 6

Figure 6. Schematic representation of the geometry used in Case 3, two-dimensional flow in an inline mixer.

Figure 7

Table 2. Comparison between present results for power coefficient, $ \alpha $, and those reported by Vial et al. (2015).

Figure 8

Figure 7. Schematic representation of the geometry used in Case 4, three-dimensional flow in an inline mixer.

Figure 9

Table 3. Features and responses utilized in each of the three test cases.

Figure 10

Table 4. Typical computational cost of individual simulations of each test case.

Figure 11

Figure 8. Comparison between regressions for case 1, with 20 samples. Circles indicate sampling locations.

Figure 12

Figure 9. Comparison between sampling strategies for Case 1, with 20 samples. Circles indicate sampling locations and numbers indicate the sampling order (positions used for initialization are unlabeled).

Figure 13

Figure 10. Error as a function of number of samples beyond the ones used for the initialization for different regression strategies, for case 1 (top), case 2 (middle) and case 3 (bottom).

Figure 14

Figure 11. Error as a function of number of samples beyond the ones used for the initialization for different sampling strategies, for Case 1 (top), Case 2 (middle), and Case 3 (bottom).

Figure 15

Figure 12. Error of the regressions developed for Case 4 as a function of number of samples beyond the ones used for the initialization, for different regression strategies (top), and different sampling strategies (bottom).

Figure 16

Figure 13. Comparison between predicted and reference values for the GP52 regression with GIO (a) and variational sampling (b) generated with 20 samples for Case 4.

Figure 17

Figure 14. Comparison between predicted and reference values for each Cases 1–3, with 5 (left) or 20 samples (right). Dashed lines represent a 20% error.

Figure 18

Table 5. Error for different sampling strategies, for 5, 10 or 20 queries.

Figure 19

Figure A1. Effect of mesh resolution on concentration profile along the symmetry axis.

Figure 20

Figure A2. Effect of mesh resolution on pressure profile along the symmetry axis.

Submit a response

Comments

No Comments have been published for this article.