Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-13T11:41:53.427Z Has data issue: false hasContentIssue false

Observable-augmented manifold learning for multi-source turbulent flow data

Published online by Cambridge University Press:  09 May 2025

Kai Fukami*
Affiliation:
Department of Aerospace Engineering, Tohoku University, Sendai 980-8579, Japan
Kunihiko Taira
Affiliation:
Department of Mechanical and Aerospace Engineering, University of California, Los Angeles, CA 90095, USA
*
Corresponding author: Kai Fukami, kfukami1@tohoku.ac.jp

Abstract

This study seeks a low-rank representation of turbulent flow data obtained from multiple sources. To uncover such a representation, we consider finding a finite-dimensional manifold that captures underlying turbulent flow structures and characteristics. While nonlinear machine-learning techniques can be considered to seek a low-order manifold from flow field data, there exists an infinite number of transformations between data-driven low-order representations, causing difficulty in understanding turbulent flows on a manifold. Finding a manifold that captures turbulence characteristics becomes further challenging when considering multi-source data together due to the presence of inherent noise or uncertainties and the difference in the spatiotemporal length scale resolved in flow snapshots, which depends on approaches in collecting data. With an example of numerical and experimental data sets of transitional turbulent boundary layers, this study considers an observable-augmented nonlinear autoencoder-based compression, enabling data-driven feature extraction with prior knowledge of turbulence. We show that it is possible to find a low-rank subspace that not only captures structural features of flows across the Reynolds number but also distinguishes the data source. Along with machine-learning-based super-resolution, we further argue that the present manifold can be used to validate the outcome of modern data-driven techniques when training and evaluating across data sets collected through different techniques. The current approach could serve as a foundation for a range of analyses including reduced-complexity modelling and state estimation with multi-source turbulent flow data.

Information

Type
JFM Rapids
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Transitional turbulent boundarylayer data sets. $(a)$ Direct numerical simulation: part of the $x{-}y$ sectional domain and subdomains $(\textrm {i}-\textrm {iii})$ are shown. $(b)$ Particle image velocimetry: three different Reynolds numbers are considered.

Figure 1

Figure 2. Observable-augmented autoencoder composed of convolutional neural network (CNN) and multi-layer perceptron (MLP) (Fukami & Taira 2023).

Figure 2

Figure 3. Representative reconstructed fields through nonlinear autoencoder compression across the latent dimension $n_{\boldsymbol \xi }$. The DNS ($Re_\tau = 897$) and PIV ($Re_\tau = 605$) fields are shown. The $L_2$ reconstruction error $\varepsilon$ is reported underneath each field.

Figure 3

Figure 4. Cross-source manifold identified through the observable-augmented nonlinear autoencoder (AE). The coefficient space composed of the three dominant POD modes is shown. The $L_2$ error $\varepsilon$ averaged over the samples is reported underneath each coefficient space.

Figure 4

Figure 5. Machine-learning-based super-resolution reconstruction for DNS data of turbulent boundary layers. The $L_2$ reconstruction error $\varepsilon$ is reported underneath each field.

Figure 5

Figure 6. Application of machine-learning-based super-resolution model ${\mathcal F}_{\textit{SR}}$ trained with the DNS data to the PIV data sets. The reconstructed fields, the $L_1$ difference field between the reconstructed and original PIV $|\varepsilon _{L_1}| = |{\boldsymbol q}_{\textit{PIV}}-{\mathcal F}_{\textit{SR}}({{\boldsymbol q}_{\textit{LR,PIV}}})|$, and the root mean square of streamwise velocity fluctuation $u_{\textit{rms}}$ across the wall-normal direction are shown.

Figure 6

Figure 7. Assessment of super-resolved PIV fields on the identified coordinate. The coefficient space composed of the three dominant POD modes is shown.