Hostname: page-component-6766d58669-fx4k7 Total loading time: 0 Render date: 2026-05-21T07:01:31.015Z Has data issue: false hasContentIssue false

Realisability-informed machine learning for turbulence anisotropy mappings

Published online by Cambridge University Press:  22 September 2025

Ryley McConkey*
Affiliation:
Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139-4307, USA University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, Canada
Nikhila Kalia
Affiliation:
University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, Canada
Eugene Yee
Affiliation:
University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, Canada
Fue-Sang Lien
Affiliation:
University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, Canada
*
Corresponding author: Ryley McConkey, rmcconke@mit.edu

Abstract

Within the context of machine learning-based closure mappings for Reynolds-averaged Navier Stokes turbulence modelling, physical realisability is often enforced using ad hoc postprocessing of the predicted anisotropy tensor. In this study, we address the realisability issue via a new physics-based loss function that penalises non-realisable results during training, thereby embedding a preference for realisable predictions into the model. Additionally, we propose a new framework for data-driven turbulence modelling which retains the stability and conditioning of optimal eddy viscosity-based approaches while embedding equivariance. Several modifications to the tensor basis neural network to enhance training and testing stability are proposed. We demonstrate the conditioning, stability and generalisation of the new framework and model architecture on three flows: flow over a flat plate, flow over periodic hills and flow through a square duct. The realisability-informed loss function is demonstrated to significantly increase the number of realisable predictions made by the model when generalising to a new flow configuration. Altogether, the proposed framework enables the training of stable and equivariant anisotropy mappings, with more physically realisable predictions on new data. We make our code available for use and modification by others. Moreover, as part of this study, we explore the applicability of Kolmogorov–Arnold networks to turbulence modelling, assessing its potential to address nonlinear mappings in the anisotropy tensor predictions and demonstrating promising results for the flat plate case.

Information

Type
JFM Papers
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Conditioning test comparing the errors in $U_1$ after injecting: (a) the DNS anisotropy tensor fully explicitly, (b) the optimal eddy viscosity (implicitly) and the remaining nonlinear part of the anisotropy tensor calculated using $S^\theta _{\textit{ij}}$ (explicitly), (c) the eddy viscosity estimated by the $k$-$\omega$ shear stress transport (SST) model (implicitly) and the remaining nonlinear part of the anisotropy tensor calculated using $S^\theta _{\textit{ij}}$ (explicitly) and (d) the eddy viscosity estimated by the $k$-$\omega$ SST model (implicitly), and the remaining nonlinear part of the anisotropy tensor calculated using $S^{{R}}_{\textit{ij}}$ (explicitly), $k$ is the turbulent kinetic energy (TKE), $\omega$ is the TKE specific dissipation rate and RSME is the Root mean squared error.

Figure 1

Figure 2. Conditioning test comparing the errors in $U_2$ after injecting: (a) the DNS anisotropy tensor fully explicitly, (b) the optimal eddy viscosity (implicitly) and the remaining nonlinear part of the anisotropy tensor calculated using $S^\theta _{\textit{ij}}$ (explicitly), (c) the eddy viscosity estimated by the $k$-$\omega$ SST model (implicitly) and the remaining nonlinear part of the anisotropy tensor calculated using $S^\theta _{\textit{ij}}$ (explicitly) and (d) the eddy viscosity estimated by the $k$-$\omega$ SST model (implicitly), and the remaining nonlinear part of the anisotropy tensor calculated using $S^{{R}}_{\textit{ij}}$ (explicitly).

Figure 2

Figure 3. Realisability penalty for the (a) diagonal components of $b_{\textit{ij}}$, and (b) off-diagonal components of $b_{\textit{ij}}$.

Figure 3

Figure 4. Model architecture and training configuration for the original TBNN proposed by Ling et al. (2016).

Figure 4

Figure 5. Model architecture and training configuration proposed in the present investigation. Note that during training, $S^\theta _{\textit{ij}}$ is used in $\hat {T}^{(1)}_{\textit{ij}}$, whereas all other quantities come from the original RANS simulation.

Figure 5

Figure 6. Test-time injection configuration proposed in the present investigation. The hatching over $g_1$ and $\hat {T}^{(1)}_{\textit{ij}}$ indicates that they are not used at injection time.

Figure 6

Figure 7. Model architecture and training configuration for TBKAN which replaces the multi-layer perceptron in the modified TBNN (see figure 5) with a KAN.

Figure 7

Table 1. Datasets used for training, validation and testing. Varied parameters are defined in § 2.1.

Figure 8

Table 2. Hyperparameters selected for each model.

Figure 9

Figure 8. Loss function vs epoch for the (a) TBNN and (b) KCNN models on the flat plate dataset; the (c) TBNN and (d) KCNN models on the square duct dataset; and the (e) TBNN and (f) KCNN models on the periodic hill dataset.

Figure 10

Figure 9. Loss function vs epoch for the TBKAN model trained on the flat plate dataset, with the best-performing KAN configuration given by a network width of $w=9$, a grid size of $g=8$ and a polynomial order of $k=3$ (cubic B-spline).

Figure 11

Figure 10. Computational domain for the zero pressure gradient flat plate boundary layer case.

Figure 12

Figure 11. Streamwise velocity profile in the ${\textit{Re}}_\theta =3630$ boundary layer predicted by the $k$-$\omega$ SST model, compared with the reference data from Schlatter & Örlü (2010). Here, $U^+_1 = U^{}_1/u_\tau$, where $u_\tau =\sqrt {\tau _w/\rho }$ is the friction velocity, and $\tau _w$ is the wall shear stress. A density of $1$ kg m$^3$ was used to be consistent with OpenFOAM’s kinematic units. The wall-normal coordinate is $y^+=x_2u_\tau /\nu$.

Figure 13

Figure 12. Momentum thickness Reynolds number growth along the flat plate as predicted by the $k$-$\omega$ SST model, and the reference DNS data from Schlatter & Örlü (2010).

Figure 14

Figure 13. Evolution of (a) $a_{11}$, (b) $a_{12}$, (c) $a_{22}$ and (d) $a_{33}$ in the ${\textit{Re}}_\theta = 3630$ boundary layer, as predicted by the DNS data from Schlatter & Örlü (2010), the TBNN/KCNN model and the baseline $k$-$\omega$ SST model.

Figure 15

Figure 14. Evolution of (a) $a_{11}$, (b) $a_{12}$, (c) $a_{22}$ and (d) $a_{33}$ in the ${\textit{Re}}_\theta = 3630$ boundary layer, as predicted by the DNS data from Schlatter & Örlü (2010), the TBKAN/KCNN model and the baseline $k$-$\omega$ SST model.

Figure 16

Figure 15. Computational (a) domain and (b) RANS mesh for the square duct flow.

Figure 17

Figure 16. Components of $a_{\textit{ij}}$ predicted by the DNS data from Pinelli et al. (2010), the $k$-$\omega$ SST model, and the TBNN/KCNN model from the present investigation. Shown here are contours of $a_{\textit{ij}}$ components in the lower left $x_2\leqslant 0, \ x_3\leqslant 0$ quadrant.

Figure 18

Figure 17. Contours of turbulent kinetic energy $k$ predicted by (a) the $k$-$\omega$ SST model, (b) the KCNN in the present investigation and (c) the DNS data from Pinelli et al. (2010). Shown here is the lower left $x_2\leqslant 0, \ x_3\leqslant 0$ quadrant.

Figure 19

Figure 18. Velocity vectors and in-plane kinetic energy predicted by (a) the DNS data from Pinelli et al. (2010), (b) the $k$-$\omega$ SST model and (c) injecting the TBNN/KCNN predictions into the RANS momentum equations. Shown here is the lower left $x_2\leqslant 0, \ x_3\leqslant 0$ quadrant.

Figure 20

Figure 19. Profiles of the in-plane velocity components (a) $U_2$ and (b) $U_3$, as predicted by the $k$-$\omega$ SST model, the injected TBNN/KCNN predictions and the DNS data from Pinelli et al. (2010). Shown here is the lower left $x_2\leqslant 0, \ x_3\leqslant 0$ quadrant.

Figure 21

Figure 20. Velocity vectors and in-plane kinetic energy predicted by (a) the DNS data from Pinelli et al. (2010), (b) the $k$-$\omega$ SST model and (c) injecting the TBNN/KCNN predictions into the RANS momentum equations. Shown here is the lower left $x_2\leqslant 0, \ x_3\leqslant 0$ quadrant.

Figure 22

Figure 21. Computational domain and mesh for the periodic hill case, with $\alpha =1.2$.

Figure 23

Figure 22. Contours of non-zero $a_{\textit{ij}}$ components predicted by the DNS data from Xiao et al. (2020), the $k$-$\omega$ SST model and the TBNN/KCNN model predictions from the present investigation.

Figure 24

Figure 23. Contours of $U_1$ and $U_1$ predicted by the DNS data from Xiao et al. (2020), the $k$-$\omega$ SST model and the injected TBNN/KCNN model predictions from the present investigation.

Figure 25

Figure 24. Contours of error in $U_1$ and $U_2$ predicted by the $k$-$\omega$ SST model and the injected TBNN/KCNN model predictions, as compared with the DNS data from Xiao et al. (2020).

Figure 26

Table 3. Comparison of mean squared error and number of non-realisable predictions when training with and without a realisability-informed loss function.

Figure 27

Figure 25. Projection of the predicted $b_{\textit{ij}}$ from the DNS reference data (a,e,i), the $k$-$\omega$ SST model (b, f, j) and the TBNN/KCNN with (d,h,l) and without (c,g,k) realisability-informed learning onto the barycentric triangle. Panels (a)–(d) show the flat plate data, (e)–(h) show the square duct data and (i)–(l) show the periodic hill data. Points outside the triangle are not realisable.

Figure 28

Table 4. Non-zero invariants of the minimal integrity basis tensor formed by $\hat {S}_{\textit{ij}}$, $\hat {R}_{\textit{ij}}$, $\hat {C}_{\textit{ij}}$ and $\hat {D}_{\textit{ij}}$.