Hostname: page-component-89b8bd64d-sd5qd Total loading time: 0 Render date: 2026-05-09T03:06:02.636Z Has data issue: false hasContentIssue false

Drag prediction of rough-wall turbulent flow using data-driven regression

Published online by Cambridge University Press:  19 February 2025

Zhaoyu Shi
Affiliation:
FLOW, Department of Engineering Mechanics, KTH, Stockholm 10044, Sweden
Seyed Morteza Habibi Khorasani
Affiliation:
FLOW, Department of Engineering Mechanics, KTH, Stockholm 10044, Sweden
Heesoo Shin
Affiliation:
Mechanical Engineering Department, Inha University, Incheon 22212, Republic of Korea
Jiasheng Yang
Affiliation:
Institute of Fluid Mechanics, Karlsruhe Institute of Technology, Karlsruhe 76131, Germany
Sangseung Lee
Affiliation:
Mechanical Engineering Department, Inha University, Incheon 22212, Republic of Korea
Shervin Bagheri*
Affiliation:
FLOW, Department of Engineering Mechanics, KTH, Stockholm 10044, Sweden
*
*Corresponding author. E-mail: shervin@mech.kth.se

Abstract

Efficient tools for predicting the drag of rough walls in turbulent flows would have a tremendous impact. However, accurate methods for drag prediction rely on experiments or numerical simulations which are costly and time consuming. Data-driven regression methods have the potential to provide a prediction that is accurate and fast. We assess the performance and limitations of linear regression, kernel methods and neural networks for drag prediction using a database of 1000 homogeneous rough surfaces. Model performance is evaluated using the roughness function obtained at a friction Reynolds number $Re_\tau$ of 500. With two trainable parameters, the kernel method can fully account for nonlinear relations between the roughness function $\Delta U^+$ and surface statistics (roughness height, effective slope, skewness, etc.). In contrast, linear regression cannot account for nonlinear correlations and displays large errors and high uncertainty. Multilayer perceptron and convolutional neural networks demonstrate performance on par with the kernel method but have orders of magnitude more trainable parameters. For the current database size, the networks’ capacity cannot be fully exploited, resulting in reduced generalizability and reliability. Our study provides insight into the appropriateness of different regression models for drag prediction. We also discuss the remaining steps before data-driven methods emerge as useful tools in applications.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Examples of the five roughness types: the three-dimensional topography of each type and their two-dimensional projections on the $x$$z$ plane are shown in the leftmost column. The number of samples of each type used in this study is given. The sample-averaged skewness and kurtosis $\langle \rangle$ are provided to demonstrate whether a surface is Gaussian or not in terms of its height distribution. Anisotropy is examined by the mean ratio of effective slopes over the samples in two directions. ES, effective slope in the x or z direction.

Figure 1

Table 2. The topographical statistics include ten ‘primary’ parameters and nine ‘pair’ parameters. The main features are divided into the ones bearing physical implications, i.e. crest height $k_c$, average height deviation $R_a$, effective slopes $ES_{x,z}$, porosity $Po$, inclinations $inc_{x,z}$; and statistical parameters, i.e. root-mean-square height $k_{rms}$, skewness $Skw$ and kurtosis $Ku$.

Figure 2

Figure 1. Scatter distributions of $\Delta U^+$ and four representative statistics of each type of roughness: (a) $k^+_{rms}$, (b) $Skw$, (c) $ES_x$ and (d) $Ku$. The dashed straight lines in (a) highlight the linear relationship between $\Delta U^+$ and $k^+_{rms}$ for merely zero and positively skewed surfaces while the ‘cluster’ distribution (circle lines) in (d) indicates a nonlinear relationship between $\Delta U^+$ and $Ku$ prediction.

Figure 3

Figure 2. Correlation coefficients $\rho$ of ten primary parameters and $\Delta U^+$ for each type of roughness. The circles in the bottom row show the linear correlation between $\Delta U^+$ and the parameters while the rest are the correlations between any two topographical parameters. Larger and darker circles represent stronger linear correlation between two variables. Those with $|\rho _{ij}|>0.5$ are annotated.

Figure 4

Figure 3. Workflow of drag prediction. The four models are evaluated by MAE, MAPE and $R^2$. The model architectures of MLP and CNN are illustrated, wherein the hyperparameters (HPs) are determined using Bayesian optimization.

Figure 5

Figure 4. The $\Delta U^+$ predictions of LR versus those from DNS: model using (a) 10 primary statistics and (b) 19 statistics (i.e. including 9 pair-product parameters.)

Figure 6

Table 3. The Bayesian-optimized HPs in $MLP_{10}$, $MLP_{19}$ and CNN that are used for prediction in this work. ReLU, rectified linear unit.

Figure 7

Figure 5. Loss curves of training and validation in the Bayesian-optimized (a) $MLP_{10}$ and (b) CNN with leaning rate reschedule. Early stopping was employed during the neural network within the BO process to mitigate overfitting and expedite training.

Figure 8

Figure 6. (a) The sample coverage in $\Delta U^+-k^+_{rms}$ space at the fraction of 30 %. The reduced training samples consistently cover the full parameter space. (b) The MAPE of inference obtained from $LR_{10}$, $MLP_{10}$, $SVR_{10}$ and CNN at different sample fractions. (c) The variation in the number of trainable parameters in each model at different sample fractions.

Figure 9

Figure 7. (a) Values of MAPE (%) (blue) and MAE (red) obtained from all models trained by the hybrid data. Left and right symbols correspond to 10 and 19 input parameters. All maximum errors correspond to negatively skewed surfaces (type: $Sk_-$). (b) Applying the trained model by the full dataset on each type of surface thus the corresponding mean errors. The data are slightly displaced on the horizontal axis for the same type of roughness to increase clarity.

Figure 10

Figure 8. The scatter distribution of $\Delta U^+$ vs $\Delta \tilde U^+$ obtained from the new SVR. The model is trained with the reduced input space involving (a) $k^+_{rms}$, $ES_x$, $ES_z$ (b) and additional $Skw$. The error reduction is observed for $Sk_-$- and $Sk_+$-roughness, marked in bold.

Figure 11

Figure 9. The average errors of each roughness category (empty circles) obtained from the three models $(LR_4, SVR_4, MLP_4)$ trained by the reduced input $\boldsymbol {x}=(k^+_{rms}, ES_x, ES_z, Skw)$. The solid crosses represent the mean error of all surfaces.

Figure 12

Figure 10. Height map of the irregular rough surface from Jelly & Busse (2019). Colour bar values indicate the surface height relative to the mean reference plane, i.e. $k/\delta$.

Figure 13

Table 4. Simulation parameters for the validation case of the irregular rough surface of figure 10: friction Reynolds number ($Re_{\tau }$); domain lengths along streamwise ($L_x$), wall-normal ($L_y$) and spanwise ($L_z$) directions; viscous-scaled streamwise grid spacing ($\Delta x^+$), minimum wall-normal grid spacing ($\Delta y^+_{min}$); maximum wall-normal grid spacing($\Delta y^+_{max}$); spanwise grid spacing ($\Delta z^+$). Additional details and descriptions can be found in Jelly & Busse (2019).

Figure 14

Figure 11. Validation results: (a) mean velocity; (b) root-mean-square velocity fluctuations; (c) Reynolds shear stress. Lines are the results using CaNS and symbols are the data from Jelly & Busse (2019).

Figure 15

Table 5. The range of friction-scaled primary parameters of five types of roughness.