Hostname: page-component-89b8bd64d-x2lbr Total loading time: 0 Render date: 2026-05-05T19:21:03.189Z Has data issue: false hasContentIssue false

Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks

Published online by Cambridge University Press:  13 January 2026

Nima Zafarmomen
Affiliation:
Clemson University , USA
Vidya Samadi*
Affiliation:
Clemson University , USA
Edoardo Borgomeo
Affiliation:
University of Cambridge , UK
*
Corresponding author: Vidya Samadi; Email: samadi@clemson.edu
Rights & Permissions [Opens in a new window]

Abstract

Extreme weather events, combined with human-induced factors, such as expanding impervious surfaces and inadequate drainage infrastructure, are driving escalating urban flood risks worldwide. In this study, we present a novel spatiotemporal Long Short-Term Memory (LSTM)-based surrogate of the U.S. Environmental Protection Agency (EPA)’s Storm Water Management Model (SWMM) to predict maximum water depth and inflow at the asset level within urban drainage networks. The high-resolution SWMM model, encompassing the full network of conduits and manholes, was first calibrated and validated using U.S. Geological Survey (USGS) observations. The LSTM surrogate was then trained on data from 5,000 rainfall events across seven Annual Recurrence Intervals (ARIs) ranging from 1 to 100 years. The SWMM-LSTM surrogate model consistently achieves high predictive performance for both water depth and inflow, highlighting its robustness across diverse storm scenarios and ARI conditions. Hyperparameter optimization via grid search revealed task-specific configurations: larger hidden layers with moderate dropout improved water depth predictions, while deeper network architectures with minimal dropout optimized inflow forecasts. By providing rapid, computationally efficient predictions without compromising accuracy, the SWMM-LSTM surrogate offers a practical tool for real-time flood risk assessment, scenario evaluation and actionable decision-making in complex urban drainage systems.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Figure 1. (a) Location of Columbia in South Carolina, USA. (b) Spatial configuration of the stormwater network and topography in the Rocky Branch Watershed, Columbia, South Carolina.

Figure 1

Table 1. Summary of Rocky Branch stormwater drainage network characteristics

Figure 2

Figure 2. SWMM–LSTM workflow for surrogate modeling. (a) SWMM process: rainfall is processed through infiltration, surface runoff, and dynamic-wave routing to produce node-level maxima (water depth and inflow), which are used as training targets; (b) Repository of event datasets containing rainfall time series and the corresponding SWMM targets; (c) Input sequence: r time steps by c predictors assembled per node event; and (d) to predict node-level depth and inflow in real time, while SWMM is used only offline to generate training labels.

Figure 3

Figure 3. Time series comparison for (a) 13–14 December 2019 and (b) 25 July 2024, showing the rainfall hyetograph (mm) and observed versus simulated discharge hydrographs (Cubic Meters per Second (CMS)), representing the SWMM calibration events.

Figure 4

Figure 4. Heatmap comparison of mean NSE values for predicting maximum water depth (top panel) and maximum inflow (bottom panel) using various LSTM hyperparameter configurations, including hidden size, number of layers and dropout rate.

Figure 5

Figure 5. (a) Training loss (MSE) and (b) training MAE over 200 training epochs for LSTM-based predictions of maximum water depth and maximum inflow.

Figure 6

Table 2. Performance of the LSTM-based surrogate model on the independent test set, evaluated across different ARIs

Figure 7

Figure 6. Boxplots of RMSE for LSTM-based predictions of maximum water depth (top) and maximum inflow (bottom) across seven ARI categories. Medians (green lines), means (green diamonds), interquartile ranges (boxes), data within 1.5× IQR (interquartile range; whiskers) and outliers (circles) are shown.

Figure 8

Figure 7. Spatial distribution of inter-event variability across junctions for (a) normalized standard deviation of maximum water depth and (b) normalized standard deviation of maximum inflow, computed over all 5,000 synthetic rainfall events. Junctions J937 and J3088 are also illustrated.

Figure 9

Figure 8. Statistical analysis of maximum water depth and inflow across ARIs at junctions J937 and J3088, showing all data points, mean, median, 5th percentile and 95th percentile across ARI categories (1, 2, 5, 10, 25, 50 and 100 years).

Supplementary material: File

Zafarmomen et al. supplementary material

Zafarmomen et al. supplementary material
Download Zafarmomen et al. supplementary material(File)
File 23.9 KB

Author comment: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R0/PR1

Comments

9/27/2025

Dear Prof. Fenner,

Editor in Chief of Cambridge Prisms: Water

On behalf of my coauthor, I wish to request your consideration of our research article titled “Spatiotemporal SWMM-LSTM Surrogate Modeling for Efficient Node-Level Water Depth and Inflow Prediction in Urban Drainage Networks” for publication in Cambridge Prisms: Water. This project is the result of a US National Science Foundation-funded project.

This study presents a novel hybrid modeling framework that integrates high-resolution, asset-level calibration of the EPA’s SWMM with spatiotemporal Long Short-Term Memory (LSTM) networks. We believe this work aligns closely with the journal’s focus on innovative water research and management strategies, particularly in urban flood risk modeling and smart water systems. Our study provides a scalable, adaptable framework that can inform resilient urban planning and climate-responsive infrastructure development.

We confirm that this manuscript has not been published elsewhere and is not under consideration by any other journal. All authors have approved the submission and declare no competing interests. Thank you very much for your consideration. We look forward to your feedback and hope that our work will contribute to advancing research in urban water management.

Sincerely,

Vidya Samadi, Ph.D., M.ASCE.

Assistant Professor & Director of Clemson Hydroinformatics Research Group,

Affiliate Faculty, Artificial Intelligence Research Institute for Science and Engineering (AIRISE), School of Computing Clemson University, SC, USA.

Review: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

This study proposes a hybrid spatiotemporal SWMM-LSTM framework that integrates the U.S. EPA’s Storm Water Management Model (SWMM) with a Long Short-Term Memory (LSTM) neural network for rapid and accurate flood prediction at the node (asset) level within urban drainage systems.

The model is applied to the Rocky Branch Watershed in Columbia, South Carolina—a flood-prone urban basin. A high-resolution SWMM model was calibrated using USGS observations and then used to generate synthetic rainfall-response datasets (5,000 events across seven ARIs from 1–100 years). The LSTM surrogate predicts maximum water depth and maximum inflow with high fidelity to SWMM results.

Results show:

NSE up to 0.97 for inflow and 0.92 for water depth.

RMSE values between 0.02–0.03 m³/s (inflow) and 0.08–0.10 m (depth).

Identification of hydraulically sensitive nodes through inter-event variability.

The framework substantially reduces computation time while maintaining high accuracy, supporting real-time decision-making in flood management.

The paper makes a significant contribution by moving beyond outlet-based predictions to node-level flood metrics. The coupling of a physically-based model with a deep learning surrogate is timely and well aligned with current hydroinformatics trends.

However, the manuscript would benefit from a more explicit quantitative comparison with previous hybrid frameworks (e.g., Zhao et al., 2023; Wu et al., 2025). Presenting performance and runtime differences in a summary table would clearly position the novelty and efficiency gains.

While model validation is solid, uncertainty quantification is limited. The study could briefly discuss:

How uncertainty in rainfall inputs or SWMM calibration parameters propagates to LSTM outputs.

Possible use of ensemble LSTM or dropout variance as uncertainty estimators.

Even a conceptual paragraph would strengthen the study’s rigor and transparency.

The paper frequently highlights the model’s “rapid computation,” but no runtime benchmarks are provided.

A concise comparison such as

“SWMM simulation = 3 hours per storm vs. LSTM = 0.5 seconds per storm”

would reinforce claims of real-time capability.

The generation of the 5,000 synthetic hyetographs is central to training. The authors mention they were “benchmarked against historical data,” but the method of synthesis (e.g., design storms, stochastic generator, scaling from observed events) is unclear.

Clarifying this process—or providing a short description of the generator and its validation—would improve reproducibility.

Figures are generally informative but can be enhanced:

Figure 4–6: Add clearer legends and unit labels; ensure consistent color scales for depth vs. inflow.

Figure 7–8: Consider larger font and emphasize node IDs (J937, J3088).

These minor adjustments will increase readability for interdisciplinary audiences.

Location Comment / Suggestion

p. 10 (§2.2) “SWMM employes” → “SWMM employs.”

Eq. 6 Replace “Rs is surface runoff” with “Rₛ denotes surface runoff.”

Fig. 2 caption “which use as training targets” → “which are used as training targets.”

p. 23 Add y-axis labels (“RMSE (m)” and “RMSE (CMS)”).

Throughout Ensure consistent units (m³/s, m) and use SI spacing conventions.

References Verify that all 2025 citations are in-press or available online; include DOIs where possible.

The manuscript is scientifically sound, well-written, and contributes valuable insights into physics-informed deep learning for hydrology. Required revisions are limited to improving contextual comparisons, figure clarity, and adding brief discussions of uncertainty and computational performance. Once addressed, the paper is fully suitable for publication in Cambridge Prisms: Water.

please consider this reference for improving the paper:

Samela, C., Manfreda, S., De Paola, F., Giugni, M., Sole, A., & Fiorentino, M. (2016). DEM-based approaches for the delineation of flood-prone areas in an ungauged basin in Africa. Journal of Hydrologic Engineering, 21(2), 06015010.

Manfreda, S., Samela, C., Gioia, A., Consoli, G. G., Iacobellis, V., Giuzio, L., & De Paola, F. (2019). A digital elevation model based method for a rapid estimation of flood inundation depth. Journal of Flood Risk Management, 12(S1), e12541.

Review: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

The manuscript “WAT-2025-0029” presents a study that develops an LSTM-based surrogate model trained on SWMM-generated data to predict maximum water depth and inflow across an urban drainage network. While the topic is relevant and timely, leveraging data-driven models to enhance urban hydrodynamic simulations, several aspects of the methodology, model structure, and data description remain ambiguous or insufficiently justified. The following comments and questions are intended to improve the clarity, reproducibility, and methodological soundness of the study.

Major Comments:

1- The description of the surrogate model (Section 2.3.1 and Figure 2) is somewhat unclear regarding the exact input–output configuration and its operational intent. From the current explanation, it seems that the LSTM model receives rainfall time series as input and predicts the maximum water depth and maximum inflow for each drainage node or junction. If this interpretation is correct, it implies that during real-time operation, once a rainfall event begins, the model could be driven by a sequence of observed or forecast rainfall to predict the evolving or final (maximum) hydraulic states. However, this functionality is not explicitly discussed or demonstrated in the manuscript.

Please clarify the following points:

• Are the LSTM inputs the full rainfall time series of each synthetic event, and are the outputs the event-level maximum for each node?

• If so, how does the model generalize about unseen rainfall events in real time? Does it require the full event to be known a priori, or can it produce predictions incrementally as rainfall data arrive (which would justify using a temporal model like LSTM)?

• If the model only predicts event-level maxima (one value per node per event), then using an LSTM (a sequence model) might not be necessary, since no sequential output or evolving state is involved. In that case, a simpler regression or feed-forward neural network could suffice.

More broadly, discuss the practical real-time applicability of the surrogate: whether it can be used operationally during storms, or if it only serves as an offline emulator of SWMM outputs for scenario screening. This distinction is critical for understanding the value and novelty of the proposed hybrid framework.

2- The manuscript mentions that 5,000 synthetic rainfall events were generated across seven Average Recurrence Intervals (ARIs) and benchmarked against historical observations (Section 2.3.1). However, the method used to produce these synthetic hyetographs is not described in sufficient detail to ensure reproducibility or to assess the representativeness of the training data. What approach or model was used to generate synthetic rainfall events? (e.g., design storm method based on IDF curves, stochastic rainfall generator, scaling of observed events, or other approaches). What storm durations, temporal distributions, and total depths correspond to each ARI category? Were the hyetographs spatially uniform or spatialy distributed across the watershed? How was “benchmarking against historical observations” quantitatively performed? Since the surrogate model’s predictive performance and generalizability depend heavily on the diversity and realism of these rainfall events, a more transparent explanation of the synthetic data generation procedure is essential.

3- The manuscript does not clearly describe how the dataset was partitioned into training, validation, and test subsets, nor does it report the model’s performance for each set. While Section 2.3.2 discusses hyperparameter tuning and Figure 5 shows training loss curves, there is no quantitative information on validation or testing outcomes. Table 2 summarizes performance across ARIs but does not specify whether these resuls correspond to the training or testing phase. How were the 5,000 rainfall events divided among training, validation, and testing subsets? Were the events split randomly or stratified by ARI categories? What performance metrics (NSE, RMSE, MAE) were achieved on the independent test set? Was any cross-validation or k-fold strategy used to ensure robustness and avoid overfitting?

4- The concept of “inter-event variability” is highlighted as a key contribution, but the analysis is limited. While Equation 10 and Figures 7–8 show normalized variability, the discussion does not explain the physical meaning, drivers, or practical implications. Please elaborate on how inter-event variability identifies hydraulic hotspots and how it can inform flood management decisions.

5- The manuscript inconsistently describes the relationship between SWMM and the LSTM model. In some sections (e.g., Abstract and Impact Statement), the framework is described as an integration of SWMM and LSTM, suggesting dynamic coupling. In other sections (e.g., Methodology, Figure 2), the LSTM is presented as a surrogate model, trained on SWMM outputs to emulate its behavior. The LSTM operates in tandem with SWMM (i.e., a coupled hybrid model), or it functions as an independent surrogate that replaces SWMM after training.

6- The calibration section (Section 3.1 and Figure 3) lacks sufficient information regarding the rainfall events used. The manuscript mentions two events (12–13 December 2019 and 25 July 2024) but does not explain why these specific storms were chosen, whether they represent extreme floods, typical events, or seasonal contrasts. Please provide the selection criteria and basic statistics for each event (total rainfall, duration, and peak intensity) to justify their representativeness for calibration.

In addition, the rainfall hyetograph in Figure 3 appears visually overlapping and difficult to interpret. Clarifying the temporal resolution of both rainfall and discharge data (e.g., 15-min, hourly) and improving the figure’s readability (thinner bars or separated panels) would enhance interpretability.

Minor Comments:

1- The manuscript does not provide sufficient information on the 5,000 synthetic rainfall events used to train the surrogate model. While seven ARI categories are mentioned, there is no table or figure showing their corresponding rainfall duration, total depth, or temporal patterns. Without this information, it is difficult to evaluate whether the generated hyetographs realistically represent the hydrologic variability of observed storms.

2- The manuscript lacks sufficient detail regarding the data used for both model calibration and training the surrogate model. While the USGS gauge 02169505 is mentioned for calibration and the City of Columbia GIS portal is acknowledged, the sources and processing of rainfall and network data are not clearly explained.

3- The case study description refers to the Rocky Branch Watershed as a single catchment, yet Section 2.2 indicates that the SWMM setup includes 2,802 manholes and 2,801 conduits, implying a subdivision into multiple subcatchments. However, the manuscript does not clarify how these subcatchments were defined or how rainfall inputs were assigned across them. Was rainfall inputs spatially uniform for the entire watershed or applied separately to each subcatchment? How many subcatchments were modeled, and what were their average sizes or key characteristics?How does this spatial structure translate into the LSTM model inputs and outputs (e.g., one rainfall input for all nodes or distinct rainfall–response pairs per subcatchment)?

4- In Page 9 (Lines 164–168), the manuscript discusses the PySWMM library and its capability to run and control SWMM within a Python environment. However, it is not clear whether PySWMM was used in this study or only mentioned as a potential tool.

5- In Figure 2 and its caption, it is clearly stated that “SWMM is used only offline to generate training labels,” indicating that the LSTM functions as a surrogate model, not an integrated or dynamically coupled system. This reinforces that the repeated references to an “integrated SWMM–LSTM framework” throughout the manuscript are conceptually inaccurate and should be revised for consistency. Additionally, in Figure 2, the direction of the arrow between panels (a) and (b) appears to be reversed. Since rainfall time series serve as input to SWMM, which then produces node-level outputs (maximum water depth and inflow), the correct information flow should be from (a) SWMM → (b) dataset repository. Please review and correct the figure accordingly to avoid confusion.

6- The manuscript does not clearly specify the temporal resolution of the rainfall and streamflow data used in the SWMM calibration and evaluation. While the synthetic rainfall events for LSTM training are noted as 15-minute resolution, the time step of observed rainfall and USGS discharge data used for calibration (Section 3.1) is not provided.

7-Figure 7 displays the spatial distribution of “inter-event variability,” but the information provided is insufficient to interpret the results. The caption and text do not specify which rainfall event(s) were used, nor do they explain what the node colors represent quantitatively. From Equation (10), it appears that the variability is computed across all events, yet this should be stated explicitly.

8- Figure 8 presents inter-event variability for two junctions, maximum water depth at J937 and maximum inflow at J3088, but the manuscript does not explain why different response variables were chosen for each node. It would be helpful to clarify: The rationale for selecting these two specific junctions (e.g., based on network location, sensitivity ranking, or flow accumulation). Why was water depth analyzed for one and inflow for the others reflect different hydraulic behaviors (e.g., ponding vs. conveyance sensitivity)? Whether these nodes represent typical or extreme variability cases across the system.

9- Page 18 (~L335) reports “hidden size 26” for the optimal inflow configuration. This appears to be a typo (likely 256). In Table 2 (and related text in Section 3.3), the RMSE and MAE values are reported without units.

Recommendation: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R0/PR4

Comments

Reviewers were rather supportive and the manuscript could represent a valuable contribution worthy of being published in Cambridge Prism: Water. Indeed, Reviewers provided several comments that could be useful to improve the manuscript. In particular, the description of many aspects of the method, such as generation of rainfall input, creation of the data sets used to train and test the method, should be enhanced in order to improve the reproducibility. Pro and cons of the method, including applicability for real time management and uncertainty characterization should be discussed and novel contributions properly highlighted. Authors are thus kindly asked to address all the comments and provide detailed replies. The decision on publication of this paper is deferred until the authors are able to revise and resubmit the paper.

Decision: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R0/PR5

Comments

No accompanying comment.

Author comment: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R1/PR6

Comments

No accompanying comment.

Review: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R1/PR7

Conflict of interest statement

No ones.

Comments

The paper is suitable for the publication. Please check English grammar.

Review: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R1/PR8

Conflict of interest statement

Non

Comments

Overall, the authors have addressed the majority of my comments, and the revised manuscript is improved in terms of clarity, methodological transparency, and consistency. I have two remaining comments for the authors’ consideration:

(1) The authors clarify that the LSTM surrogate can be driven incrementally as rainfall data arrive to update predictions of event-level peak depth and inflow. However, the model is trained using complete rainfall sequences with supervision applied only at the event level (i.e., final maxima), rather than with time-step–resolved or truncated-sequence targets. Consequently, incremental predictions during an ongoing storm represent extrapolations from partial inputs rather than explicitly learned time-evolving forecasts. To avoid potential overstatement of operational real-time capability, the manuscript would benefit from clearly distinguishing between incremental inference enabled by the model architecture and the current training strategy. Alternatively, if the authors intend to claim true incremental predictive capability, this could be supported by additional results demonstrating model behavior when driven by partial rainfall sequences.

(2) While reporting test-set performance is appropriate and the loss curves in Figure 5 provide useful insight into convergence and overfitting behavior, the manuscript does not report quantitative performance metrics (e.g., NSE, RMSE, MAE) for the training and validation sets. Providing a summary of these metrics, either in a supplementary table or briefly in the main text, would further support the assessment of model generalization and complement the loss-based diagnostics.

Recommendation: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R1/PR9

Comments

Authors properly addressed reviewers' comments. The manuscript is almost ready for publication provided that Authors address last minor comments of a reviewer

Decision: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R1/PR10

Comments

No accompanying comment.

Author comment: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R2/PR11

Comments

No accompanying comment.

Recommendation: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R2/PR12

Comments

Authors properly addressed last minor comments. The manuscript can be accepted for publication. Congratulations

Decision: Spatiotemporal SWMM-LSTM surrogate modeling for efficient node-level water depth and inflow prediction in urban drainage networks — R2/PR13

Comments

No accompanying comment.