Hostname: page-component-6766d58669-nf276 Total loading time: 0 Render date: 2026-05-15T08:50:01.092Z Has data issue: false hasContentIssue false

Physics-constrained deep reinforcement learning for flow field denoising

Published online by Cambridge University Press:  13 October 2023

Mustafa Z. Yousif
Affiliation:
School of Mechanical Engineering, Pusan National University, 2, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea LSTME-Busan Branch, German Engineering Research and Development Center, 1276 Jisa-dong, Gangseo-gu, Busan, 46742, Republic of Korea
Meng Zhang
Affiliation:
School of Mechanical Engineering, Pusan National University, 2, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
Linqi Yu
Affiliation:
School of Mechanical Engineering, Pusan National University, 2, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
Yifan Yang
Affiliation:
School of Mechanical Engineering, Pusan National University, 2, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
Haifeng Zhou
Affiliation:
School of Mechanical Engineering, Pusan National University, 2, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
HeeChang Lim*
Affiliation:
School of Mechanical Engineering, Pusan National University, 2, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
*
Email address for correspondence: hclim@pusan.ac.kr

Abstract

A multi-agent deep reinforcement learning (DRL)-based model is presented in this study to reconstruct flow fields from noisy data. A combination of reinforcement learning with pixel-wise rewards, physical constraints represented by the momentum equation and the pressure Poisson equation, and the known boundary conditions is used to build a physics-constrained deep reinforcement learning (PCDRL) model that can be trained without the target training data. In the PCDRL model, each agent corresponds to a point in the flow field and learns an optimal strategy for choosing pre-defined actions. The proposed model is efficient considering the visualisation of the action map and the interpretation of the model operation. The performance of the model is tested by using direct numerical simulation-based synthetic noisy data and experimental data obtained by particle image velocimetry. Qualitative and quantitative results show that the model can reconstruct the flow fields and reproduce the statistics and the spectral content with commendable accuracy. Furthermore, the dominant coherent structures of the flow fields can be recovered by the flow fields obtained from the model when they are analysed using proper orthogonal decomposition and dynamic mode decomposition. This study demonstrates that the combination of DRL-based models and the known physics of the flow fields can potentially help solve complex flow reconstruction problems, which can result in a remarkable reduction in the experimental and computational costs.

Information

Type
JFM Papers
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press.
Figure 0

Figure 1. Learning process in the PCDRL model. Each agent at each iteration step in the episode obtains a state from a point in the flow, calculates the reward and applies an action according to the policy.

Figure 1

Figure 2. Progress of the mean reward during the training process. Cases 1, 2 and 3 represent the noisy DNS data at noise levels $1/{SNR} = 0.01$, 0.1 and 1, respectively.

Figure 2

Figure 3. Action map of the prediction process for an instantaneous streamwise velocity field. The top panels show the types of filters used in the process and the action map in each iteration step, and the bottom panels show the corresponding velocity field. Results for the DNS noisy data at noise level $1/{SNR} = 0.1$.

Figure 3

Figure 4. (a) Instantaneous vorticity field; (b) relative $L_2$-norm error of the reconstructed velocity fields. Cases 1, 2 and 3 represent the noisy DNS data at noise levels $1/{SNR} = 0.01$, 0.1 and 1, respectively.

Figure 4

Figure 5. Probability density function plots of the (a) streamwise and (b) spanwise velocity components. Cases 1, 2 and 3 represent the noisy DNS data at noise levels $1/{SNR} = 0.01$, 0.1 and 1, respectively.

Figure 5

Figure 6. Scatter plots of the maximum instantaneous values of the (a) streamwise and (b) spanwise velocity components. Cases 1, 2 and 3 represent the results from the PCDRL model using noisy DNS data at noise levels $1/{SNR} = 0.01$, 0.1 and 1, respectively. The contour colours (from blue to red) are proportional to the density of points in the scatter plot.

Figure 6

Figure 7. Power spectral density plots of the streamwise velocity fluctuations at two different locations: (a) $(x/D,y/D) = (1,1)$ and (b) $(x/D,y/D) = (6,1)$. The dimensionless frequency is represented by the Strouhal number, $St=fD/u_{\infty }$, where $f$ is the frequency. Cases 1, 2 and 3 represent the noisy DNS data at noise levels $1/{SNR} = 0.01$, 0.1 and 1, respectively.

Figure 7

Figure 8. Spanwise profiles of flow statistics $u_{rms}$ (left column), $v_{rms}$ (middle column) and $\overline {u'v'}$ (right column) at two different streamwise locations: (a) $x/D = 3$; (b) $x/D = 6$. Cases 1, 2 and 3 represent the results from the PCDRL model using noisy DNS data at noise levels $1/{SNR} = 0.01$, 0.1 and 1, respectively.

Figure 8

Figure 9. (a) Instantaneous vorticity field of the noisy (left column) and denoised PIV data obtained from the PCDRL model (right column); (b) relative difference of the spanwise profile of the vorticity root mean square at two different streamwise locations.

Figure 9

Figure 10. Leading POD modes obtained from the DNS data. Results from the ground truth DNS (left column), PCDRL (middle column) and noisy data with $1/SNR = 1$ (right column).

Figure 10

Figure 11. Normalised energy (left column) and cumulative energy (right column) of the POD modes obtained from the DNS data: (a) noisy data, where Cases 1, 2 and 3 represent the noisy DNS data at noise levels $1/{SNR} = 0.01$, 0.1 and 1, respectively; (b) results from the PCDRL model.

Figure 11

Figure 12. Reconstructed instantaneous vorticity field obtained from the DNS data using the first ten POD modes. Cases 1, 2 and 3 represent the results of using noisy DNS data at noise levels $1/{SNR} = 0.01$, 0.1 and 1, respectively.

Figure 12

Figure 13. Leading POD modes obtained from the PIV data. Results from clear PIV (left column), PCDRL (middle column) and noisy PIV data (right column).

Figure 13

Figure 14. (a) Normalised energy and (b) cumulative energy of the POD modes obtained from the PIV data.

Figure 14

Figure 15. Reconstructed instantaneous vorticity field obtained from the results of the (a) PCDRL model and (b) noisy PIV data using the first ten POD modes.

Figure 15

Figure 16. (a) DMD eigenvalues of the noisy DNS data at noise level $1/SNR = 1$ and (b) the results from the PCDRL model visualised on the unit circle.

Figure 16

Figure 17. (a) DMD eigenvalues of the noisy PIV data and (b) the results from the PCDRL model visualised on the unit circle.

Figure 17

Figure 18. Architecture of the actor–critic algorithm.

Figure 18

Figure 19. Architecture of the fully convolutional A3C.

Figure 19

Table 1. Action set for the denoising process.