Hostname: page-component-77f85d65b8-lfk5g Total loading time: 0 Render date: 2026-04-22T09:50:54.507Z Has data issue: false hasContentIssue false

MLP-mixer-based deep learning network for pedestrian-level wind assessment

Published online by Cambridge University Press:  02 January 2025

Adam Clarke*
Affiliation:
Centre for Defence Engineering, Cranfield University, Defence Academy of the UK, Shrivenham, United Kingdom
Knut Erik Teigen Giljarhus
Affiliation:
Department of Mechanical and Structural Engineering and Materials Science, University of Stavanger, Stavanger, Norway Nablaflow AS, Stavanger, Norway
Luca Oggiano
Affiliation:
Nablaflow AS, Stavanger, Norway
Alistair Saddington
Affiliation:
Centre for Defence Engineering, Cranfield University, Defence Academy of the UK, Shrivenham, United Kingdom
Karthik Depuru-Mohan
Affiliation:
Centre for Defence Engineering, Cranfield University, Defence Academy of the UK, Shrivenham, United Kingdom
*
Corresponding author: Adam Clarke; Email: adam.p.clarke@cranfield.ac.uk

Abstract

This article addresses the challenges of assessing pedestrian-level wind conditions in urban environments using a deep learning approach. The influence of large buildings on urban wind patterns has significant implications for thermal comfort, pollutant transport, pedestrian safety, and energy usage. Traditional methods, such as wind tunnel testing, are time-consuming and costly, leading to a growing interest in computational methods like computational fluid dynamics (CFD) simulations. However, CFD still requires a significant time investment for such studies, limiting the available time for design modification prior to lockdown. This study proposes a deep learning surrogate model based on a MLP-mixer architecture to predict mean flow conditions for complex arrays of buildings. The model is trained on a diverse dataset of synthetic geometries and corresponding CFD simulations, demonstrating its effectiveness in capturing intricate wind dynamics. The article discusses the model architecture and data preparation and evaluates its performance qualitatively and quantitatively. Results show promising capabilities in replicating key wind features with a mean error of 0.3 m/s and rarely exceeding 0.75 m/s, making the proposed model a valuable tool for early-stage urban wind modelling.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-ShareAlike licence (http://creativecommons.org/licenses/by-sa/4.0), which permits re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Training Data Overview: (a) 3D model of a synthetic geometry used to generate training data for the deep learning model. (b) Processed geometry, a 2D representation of the synthetic geometry after preprocessing, where pixel colour corresponds to the height of the buildings. (c) Postprocessed CFD Data where the RGB channels encode the velocity components in the x, y, and z directions. This image provides a visual representation of the ground truth used for training and evaluating the deep learning model.

Figure 1

Figure 2. A schematic representation of the proposed modified image-to-image mixer model adapted from (Mansour et al., 2023). Details of the mixing layers are shown underneath.

Figure 2

Figure 3. A qualitative comparison between the predicted flow field generated by the deep learning model and the corresponding CFD simulation. Highlighted areas pinpoint instances where the model successfully replicates essential features of the wind flow, providing valuable insights into its performance.

Figure 3

Figure 4. Error plots depicting the difference between the predicted magnitude generated by the deep learning model and the simulated magnitude from CFD for the entire image and the centre section. The magnitude difference is normalised by the reference velocity at a 2 m height.

Figure 4

Table 1. Performance comparison between the standard MLP mixer and the modified version measured on the test set. Mean absolute errors and the 90th percentile absolute errors for the entire image and the centre section, excluding building or masked corner pixels, provide insights into the accuracy and robustness of each model in predicting pedestrian-level wind conditions. Lower errors indicate superior performance.

Figure 5

Figure 5. A side-by-side comparison of the predicted flow fields produced by the standard mixer model (a) and the proposed modified version (b).

Figure 6

Table 2. Top: Influence of training set size on model performance. Bottom: Influence of model size on performance

Author comment: MLP-mixer-based deep learning network for pedestrian-level wind assessment — R0/PR1

Comments

This article is part of the Climate Informatics 2024 proceedings and was accepted in Environmental Data Science on the basis of the Climate Informatics peer review process.

Review: MLP-mixer-based deep learning network for pedestrian-level wind assessment — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

>Summary: In this section please explain in your own words what problem the paper addresses and what it contributes to solving it.

The paper addresses the problem of pedestrian-level wind assessment, which aims at measuring the impact of urban design on wind flows in cities. Current methods are based on costly Computational Fluid Dynamics (CFD) simulations, which slows down the development cycle of urban designers. The paper proposes to learn a surrogate model with deep learning to approximate the mean flow conditions (simulated with RANS) for complex arrays of buildings.

>Relevance and Impact: Is this paper a significant contribution to interdisciplinary climate informatics?

The paper is mostly relevant for urban designers, but is only remotely relevant for climate informatics. It is claimed that the proposed method will help “”understanding and mitigating the impact of urban structures on micro-scale wind patterns“”, yet the proposed method is not directly related to climate. “”Energy generation“” is mentioned as an application, which I guess refers to wind turbines and could be relevant for mitigating climate change (although urban wind turbines cannot account for a significant amount of decarbonized energy generation).

I think the most relevant aspect of the paper for climate informatics is the learning of a surrogate model for a CFD simulation, which could be relevant to learn a surrogate model for GCMs or weather models.

>Detailed Comments

strengths:

- the idea of training a surrogate model for CFD is interesting.

- the dataset seems well prepared.

weaknesses:

The major weakness is the lack of quantitative evaluation:

- The quantitative evaluation is very limited and does not give many insights. What is the influence of model size ? architecture choice ? dataset size ? image resolution ?

- It is strange that the paper only uses a MLP-mixer model, quite uncommon in the litterature, showing in the end that “”neighbourhood mixing“” blocks (implemented with convolutional layers) are needed. Why not directly use a standard convolutional U-Net model / transformer-based model like Swin ?

- there is a strong focus on feature engineering, but it does not benefit the model.

Details:

- L07 “”Unlike a Convolutional Neural Network, the receptive field is not limited by the size of the convolutional filters, allowing for transfer of information across the full extent of the image“” -> In CNNs, the composition of may convolutional layers allows the receptive field to cover the whole image. If the authors want to show that the MLP mixer better transfer information than CNNs, quantitative comparison is needed.

- The paper claims that the architecture is based on attention (as emphasized in the title), yet the MLP blocks in the base architecture (image to image MLP-mixer) are not considered an attention mechanism. “”Attention“” is reserved to the key-query-value mechanism with a softmax, from which we can visualize attention maps (between 0 and 1), something that would have been interesting to see the paper if it included attention.

- lacking a section of inference speed (trained model versus CFD)

- missing some details (e.g. parameter count, number of layers)

- typo L30 “”in the is“”

Overall, while the problem setting is interesting, the machine learning treatment requires revision to be more informative, especially given that there is no other application than reproducing the RANS simulation.

Recommendation: MLP-mixer-based deep learning network for pedestrian-level wind assessment — R0/PR3

Comments

This article was accepted into Climate Informatics 2024 Conference after the authors addressed the comments in the reviews provided. It has been accepted for publication in Environmental Data Science on the strength of the Climate Informatics Review Process.

Decision: MLP-mixer-based deep learning network for pedestrian-level wind assessment — R0/PR4

Comments

No accompanying comment.