Hostname: page-component-77f85d65b8-8v9h9 Total loading time: 0 Render date: 2026-03-28T08:20:48.479Z Has data issue: false hasContentIssue false

FloodGNN-GRU: a spatio-temporal graph neural network for flood prediction

Published online by Cambridge University Press:  08 November 2024

Arnold Kazadi
Affiliation:
Department of Computer Science, Rice University, Houston, TX, USA
James Doss-Gollin
Affiliation:
Department of Civil and Environmental Engineering, Rice University, Houston, TX, USA
Antonia Sebastian
Affiliation:
Department of Earth, Marine and Environmental Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Arlei Silva*
Affiliation:
Department of Computer Science, Rice University, Houston, TX, USA
*
Corresponding author: Arlei Silva; Email: arlei@rice.edu

Abstract

Classical approaches for flood prediction apply numerical methods for the solution of partial differential equations that capture the physics of inundation processes (e.g., the 2D Shallow Water equations). However, traditional inundation models are still unable to satisfy the requirements of many relevant applications, including early-warning systems, high-resolution (or large spatial domain) simulations, and robust inference over distributions of inputs (e.g., rainfall events). Machine learning (ML) approaches are a promising alternative to physics-based models due to their ability to efficiently capture correlations between relevant inputs and outputs in a data-driven fashion. In particular, once trained, ML models can be tested/deployed much more efficiently than classical approaches. Yet, few ML-based solutions for spatio-temporal flood prediction have been developed, and their reliability/accuracy is poorly understood. In this paper, we propose FloodGNN-GRU, a spatio-temporal flood prediction model that combines a graph neural network (GNN) and a gated recurrent unit (GRU) architecture. Compared to existing approaches, FloodGNN-GRU (i) employs a graph-based model (GNN); (ii) operates on both spatial and temporal dimensions; and (iii) processes the water flow velocities as vector features, instead of scalar features. We evaluate FloodGNN-GRU using a LISFLOOD-FP simulation of Hurricane Harvey (2017) in Houston, Texas. Our results, based on several metrics, show that FloodGNN-GRU outperforms several data-driven alternatives in terms of accuracy. Moreover, our approach can be trained 100x faster and tested 1000x faster than the time required to run a comparable simulation. These findings illustrate the potential of ML-based methods to efficiently emulate physics-based inundation models, especially for short-term predictions.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. (a) On a general graph, each node/cell $ i $ has an inflow with an in-velocity $ {\mathbf{a}}_i $ and outflow with an out-velocity $ {\mathbf{b}}_i $. We note that water can enter and exit the cell in any direction and $ {\mathbf{a}}_i $ and $ {\mathbf{b}}_i $ will differ depending on the properties of node $ i $ (e.g., friction and elevation). (b) the same vectors can be represented on a staggered grid using (left and top) cell interfaces as a basis for in-velocity $ {\mathbf{a}}_i $ and (right and bottom) cell interfaces as a basis for out-velocity. This convention is applied by LISFLOOD-FP (Bates et al., 2013).

Figure 1

Table 1. Input features. Pre-processing means that the data were calculated from other features

Figure 2

Figure 2. Spatial distribution of static features in our dataset.

Figure 3

Figure 3. Overview of FloodGNN-GRU. At each time $ t $, the region $ {R}_g $ is in state $ {R}_g^t $ with scalar features $ {\mathbf{s}}_i^t $ and vector features $ {\mathbf{V}}_i^t $ for each node/cell $ {v}_i $. These are processed through a FloodGNN-GRU to produce hidden state $ {H}^t $ that captures both spatial and temporal information on the dynamics of a flooding event. $ {H}^t $ is later used for the estimation of the next water depth $ {\tilde{w}}_i^{t+1} $ and velocities $ {\tilde{\mathbf{a}}}_i^{t+1} $ and $ {\tilde{\mathbf{b}}}_i^{t+1} $. The L1 loss function between $ {\tilde{w}}_i^{t+1} $, $ {\tilde{\mathbf{a}}}_i^{t+1} $, $ {\tilde{\mathbf{b}}}_i^{t+1} $ and their ground truth values $ {w}_i^{t+1} $, $ {\mathbf{a}}_i^{t+1} $, $ {\mathbf{b}}_i^{t+1} $ is used for parameter learning in our model.

Figure 4

Figure 4. (a) Digital elevation model (DEM) of Houston, Texas. (b) Generation of non-overlapping sub-regions and illustrative example of the split of these sub-regions into training and test sets.

Figure 5

Table 2. Predictions time-step of size 1-h

Figure 6

Table 3. RSME on the velocity predictions in terms of mean and standard deviation values over 3 random, independent experiments

Figure 7

Figure 5. Predictions from time $ t=1 $ to $ t=20 $ (in hours) with 1-hour time intervals. Each solid line represents the mean over three experiments, and the shadows along the solid lines represent standard deviations. The results show that FloodGNN-GRU achieves the best results compared to other methods.

Figure 8

Figure 6. Scatter plot of water depths in log–log scale at time $ t=1 $. FloodGNN-GRU produces predictions that are the most aligned with true values. Pearson’s coefficient of correlation values are given at the bottom of each plot.

Figure 9

Figure 7. Scatter plot of water depths in log–log scale at time $ t=5 $. FloodGNN-GRU produces predictions that are the most aligned with true values. Pearson’s coefficient of correlation values are given at the bottom of each plot.

Figure 10

Figure 8. Scatter plot of water depths in log–log scale at time $ t=10 $. FloodGNN-GRU produces predictions that are the most aligned with true values. Pearson’s coefficient of correlation values are given at the bottom of each plot.

Figure 11

Figure 9. True water depth in meters (Column 1) compared to FloodGNN-GRU predictions (Column 2), and the water depth difference map (Column 3). Our approach can localize the flood (i.e., locations with larger water depths), but the extent of the flood is often underestimated, which is consistent with the results from Figures 6 to8. Note that the color scales are different for each time to better appreciate the difference between the target and prediction values at the different water depth levels.

Figure 12

Table 4. Computation times

Figure 13

Figure 10. Performance when FloodGNN-GRU is trained with 2 FloodGNN layers instead of 1, and when it is trained with $ {L}_2 $ loss instead of $ {L}_1 $.

Figure 14

Figure 11. Performance when the GRU module of FloodGNN-GRU is removed.