Hostname: page-component-77f85d65b8-pkds5 Total loading time: 0 Render date: 2026-03-29T09:25:52.924Z Has data issue: false hasContentIssue false

Interpreted machine learning in fluid dynamics: explaining relaminarisation events in wall-bounded shear flows

Published online by Cambridge University Press:  13 May 2022

Martin Lellep*
Affiliation:
SUPA, School of Physics and Astronomy, The University of Edinburgh, James Clerk Maxwell Building, Peter Guthrie Tait Road, Edinburgh EH9 3FD, UK
Jonathan Prexl
Affiliation:
Department of Civil, Geo and Environmental Engineering, Technical University of Munich, D-80333 Munich, Germany
Bruno Eckhardt
Affiliation:
Physics Department, Philipps-University of Marburg, D-35032 Marburg, Germany
Moritz Linkmann
Affiliation:
School of Mathematics and Maxwell Institute for Mathematical Sciences, University of Edinburgh, Edinburgh EH9 3FD, UK
*
Email address for correspondence: martin.lellep@ed.ac.uk

Abstract

Machine Learning (ML) is becoming increasingly popular in fluid dynamics. Powerful ML algorithms such as neural networks or ensemble methods are notoriously difficult to interpret. Here, we introduce the novel Shapley additive explanations (SHAP) algorithm (Lundberg & Lee, Advances in Neural Information Processing Systems, 2017, pp. 4765–4774), a game-theoretic approach that explains the output of a given ML model in the fluid dynamics context. We give a proof of concept concerning SHAP as an explainable artificial intelligence method providing useful and human-interpretable insight for fluid dynamics. To show that the feature importance ranking provided by SHAP can be interpreted physically, we first consider data from an established low-dimensional model based on the self-sustaining process (SSP) in wall-bounded shear flows, where each data feature has a clear physical and dynamical interpretation in terms of known representative features of the near-wall dynamics, i.e. streamwise vortices, streaks and linear streak instabilities. SHAP determines consistently that only the laminar profile, the streamwise vortex and a specific streak instability play a major role in the prediction. We demonstrate that the method can be applied to larger fluid dynamics datasets by a SHAP evaluation on plane Couette flow in a minimal flow unit focussing on the relevance of streaks and their instabilities for the prediction of relaminarisation events. Here, we find that the prediction is based on proxies for streak modulations corresponding to linear streak instabilities within the SSP. That is, the SHAP analysis suggests that the break-up of the self-sustaining cycle is connected with a suppression of streak instabilities.

Information

Type
JFM Papers
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press.
Figure 0

Figure 1. Time series of the nine spectral coefficients $a_i$ in (2.3), with the laminar coefficient $a_1$ shown in black and modes $a_2$ to $a_9$ are shown in red to yellow. The dashed green line represents the threshold between turbulent and laminar dynamics as defined by an energy threshold on the deviations of the laminar profile $E_l = 5 \times 10^{-3}$, see (4.1). The number of snapshots per training sample is set to $N_s=5$, which are $\Delta t$ apart. The temporal spacing is set to $\Delta t=100$ in this example for visual purposes only, $\Delta t=3$ is used in all calculations with $N_s > 1$. The short orange vertical lines mark prediction time horizons of $t_p=\{200,300,350\}$ for visual guidance, see § 4 for further details.

Figure 1

Figure 2. Normalised training data distributions of modes $1$ to $9$. Here, $t_p=300$ is shifted upwards for visual purposes. Class $1$ ($0$) corresponds to samples that do (not) relaminarise after $t_p$ time steps.

Figure 2

Figure 3. Prediction performance of the trained classifier against temporal prediction horizon $t_p$. The error bar for $t_p=300$ shows $100$ standard deviations $\sigma$ for visualisation purposes and demonstrates the robustness of the results. It has been obtained by training the classifier on training datasets based on different initial random number generator seeds.

Figure 3

Figure 4. Classifier applied in parallel to fluid simulation. (a) Time series with indicated prediction output. The mode corresponding to the laminar profile, $a_1$, is shown in black and modes $a_2$ to $a_9$ are shown in red to yellow. (b) Compared normalised confusion matrices of the model evaluated on the test dataset (top) and during the live prediction (bottom). The normalisation is required to compare both confusion matrices because of the class imbalance between model testing and live prediction. See main text for more details.

Figure 4

Figure 5. Normalised SHAP value distributions of modes $1$ to $9$ for the $10^{5}$ test samples; $t_p=300$ is shifted upwards for visual purposes. Class $1$ ($0$) corresponds to samples that do (not) relaminarise after $t_p$ time steps.

Figure 5

Figure 6. The mean absolute SHAP values as distinguished by the underlying class for $t_p=300$. Class $1$ ($0$) corresponds to samples that do (not) relaminarise after $t_p$ time steps.

Figure 6

Figure 7. Feature importances as measured by mean absolute SHAP values. (a) The feature importances normalised separately for each $t_p$ along its row to show the hierarchy of mode importance. (b) Normalisation constants used in (a). To convert the normalised values shown in (a) to their absolute counterparts, each row would need to be multiplied by the corresponding normalisation shown in (b).

Figure 7

Figure 8. (a,b) Correlation matrices of training data for classes 0 and 1, respectively. Class $1$ ($0$) corresponds to samples that do (not) relaminarise after $t_p$ time steps. (c,d) Mean absolute SHAP interaction values of the first $N_v/10=10\,000$ validation samples for classes 0 and 1, respectively. As self-correlations encoded in the diagonal elements do not convey useful information, the diagonal elements have been set to zero for all panels for presentational purposes.

Figure 8

Figure 9. Two representative velocity-field samples (top) and corresponding SHAP values (bottom) for (a) class $0$ and (b) class $1$. For the velocity field, streak cross-sections, that is the deviation of the streamwise velocity component from the laminar profile evaluated at $x_1 = 0$ is shown. As can be seen by comparison of the top and bottom panels, SHAP uses streak tails, that is regions where the streaks are spatially decaying, for classification towards class $1$ and streak cores, where the velocity is nearly uniform, for classification towards class $0$.

Figure 9

Figure 10. Schematic classification of a spectral velocity field $\boldsymbol {a}$ by a decision tree. The dotted and dashed lines denote positive and negative decisions, respectively. The $J=3$ terminal notes are coloured in grey and the dashed terminal node marks the output of the example classification.

Figure 10

Table 1. Table of optimal hyperparameters for XGBoost classifier for the task of predicting the relaminarisation of the turbulent trajectory. The abbreviations in the header line have been introduced in the main text of Appendix A.1.