Hostname: page-component-77f85d65b8-zzw9c Total loading time: 0 Render date: 2026-03-29T05:42:02.146Z Has data issue: false hasContentIssue false

Modeling snow on sea ice using physics-guided machine learning

Published online by Cambridge University Press:  02 January 2025

Ayush Prasad*
Affiliation:
Earth Observation Research, Finnish Meteorological Institute, Helsinki, Finland
Ioanna Merkouriadi
Affiliation:
Earth Observation Research, Finnish Meteorological Institute, Helsinki, Finland
Aleksi Nummelin
Affiliation:
Marine Research, Finnish Meteorological Institute, Helsinki, Finland
*
Corresponding author: Ayush Prasad; Email: ayush.prasad@fmi.fi

Abstract

Snow is a crucial element of the sea ice system, affecting the sea ice growth and decay due to its low thermal conductivity and high albedo. Despite its importance, present-day climate models have a very idealized representation of snow, often including just one-layer thermodynamics, omitting several processes that shape its properties. Even though sophisticated snow process models exist, they tend to be excluded in climate modeling due to their prohibitive computational costs. For example, SnowModel is a numerical snow process model developed to simulate the evolution of snow depth and density, blowing snow redistribution and sublimation, snow grain size, and thermal conductivity in a spatially distributed, multilayer snowpack framework. SnowModel can simulate snow distributions on sea ice floes in high spatial (1-m horizontal grid) and temporal (1-hour time step) resolution. However, for simulations spanning over large regions, such as the Arctic Ocean, high-resolution runs face challenges of slow processing speeds and the need for large computational resources. To address these common issues in high-resolution numerical modeling, data-driven emulators are often used. However, these emulators have their caveats, primarily a lack of generalizability and inconsistency with physical laws. In our study, we address these challenges by using a physics-guided approach in developing our emulator. By integrating physical laws that govern changes in snow density due to compaction, we aim to create an emulator that is efficient while also adhering to essential physical principles. We evaluated this approach by comparing three machine learning models: long short-term memory (LSTM), physics-guided LSTM, and Random Forest, across five distinct Arctic regions. Our evaluations indicate that all models achieved high accuracy, with the physics-guided LSTM model demonstrating the most promising results in terms of accuracy and generalizability. Our approach offers a computationally faster way to emulate the SnowModel with high fidelity and a speedup of over 9000 times.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open data
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Arctic study regions.

Figure 1

Table 1. Variables driving SnowModel and inputs for emulators

Figure 2

Table 2. Comparison of RMSE across the study regions for different emulator models. Lower values are better

Figure 3

Figure 2. Predicted average snow density across all the points in Region 3 by emulator compared with the snow density from SnowModel.

Figure 4

Figure 3. Spatial maps showing snow density as predicted by the emulator and compared with those from the SnowModel for Region 2. The third column shows the error percentage (lower values are better), highlighting the differences between the SnowModel and the emulator.

Figure 5

Table 3. Comparison of runtime between SnowModel and emulators. Lower values are better

Figure 6

Figure 4. Feature importance derived from permutation shuffling in the physics-guided LSTM.

Author comment: Modeling snow on sea ice using physics-guided machine learning — R0/PR1

Comments

Dear Environmental Data Science editors,

This is our manuscript which was accepted and presented at Climate Informatics 2024 and accepted for publication in Environmental Data Science. In this work, we develop a Physics-guided Machine learning model to emulate SnowModel in the Arctic.

Review: Modeling snow on sea ice using physics-guided machine learning — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

>Summary: In this section please explain in your own words what problem the paper addresses and what it contributes to solving it.

This paper demonstrates the use of physics-informed machine learning (ML) to predict snow density on sea ice given meteorological and topographic features across the Arctic as predictors, finding that physics-informed ML methods outperform vanilla ML methods and climatological baselines. More specifically, the authors address an emulation task in which a computationally expensive dynamical snow model, namely SnowModel, is run on meteorological forcing variables comprising precipitation, air temperature, wind speed, relative humidity, day of year, and topography to compute snow density, snow depth, and snow temperature to produce training data for the 2010-2020 period. The goal is then to learn an ML model which emulates SnowModel in predicting snow density, snow depth, and snow temperature conditional on meteorological forcing and topography at a lower computational cost. Through a series of experiments comparing physics-guided LSTM networks, vanilla LSTM networks, random forests, and climatological baselines across five representative sites sampled across the Arctic, the authors find that physics-guided LSTM networks achieve the highest performance by constraining the predicted snow density, snow depth, and snow temperature through known physical relationships. A runtime analysis indicates that the proposed physics-informed LSTM is 9931 times faster than the dynamical SnowModel, while a feature importance analysis indicates that air temperature is the most influential variable on predicted snow density.

>Relevance and Impact: Is this paper a significant contribution to interdisciplinary climate informatics?

Yes, reducing the computational burden of climate science is a key strategic priority across all subdisciplines of the field, and this work’s ability to successfully emulate an expensive dynamical model using physics-informed ML methods will help to inform and advance other emulation studies. The physical phenomenon of snow on sea ice is also of significant interest in the polar science community given the role it plays in mediating surface albedo, thermodynamic exchange, and ocean-atmosphere fluxes more generally, hence this work is a welcome contribution from the domain science point of view.

>Detailed Comments

Overall, the paper reads well, and I wholeheartedly recommend acceptance once the following points are addressed:

1. Clarification of the spatial structure of predictors and targets earlier in the manuscript would be helpful. After Sections 1 and 2 it was not clear to me whether the predictors and targets were simply timeseries of snow characteristics at single spatial points or spatiotemporal fields encompassing characteristics within a small-scale simulation region. In Section 3.2 it is noted that regions of study are 1500x1500 pixels, and Figure 3 implies a spatial structure, so it becomes clear eventually! But on a related note, it is not clear whether the timeseries shown in Figure 2 are spatial averages of snow density across the region? Or whether they are timeseries from a single pixel? Or some other method of spatial reduction?

2. Related to 1, clarification on how models handle spatial structure in the data would be helpful. Are predictions made pixelwise, using the timeseries from a single pixel to predict its evolution through time? Or is there some inductive bias (e.g. convolution) in the models exploiting spatial autocorrelation in the data? If there is not, why not? Is this a possible future direction of research?

3. Further description of the physics informed loss function, L_physics, would be helpful. In particular, an explicit equation showing what terms go into the equation would clear things up. Are the MSEs “between the model’s predicted values for snow density, depth, and temperature, and their corresponding values from the SnowPack equations” weighted relative to one another? If so or if not, why? Are the variables preprocessed or normalised in any way to ensure the relative contributions of each variable to the overall loss are sensible?

4. How is the Lambda-weighting set in the overall loss function (Equation 3)? What hyperparameter search method is used and what value is selected?

5. The justification for using spatial LOOCV is strong assuming the end-goal of the emulator is to emulate on unseen geographical sites. But it may be worth explicitly noting this. If the emulator’s goal were to generalise temporally to e.g. make future predictions under climate change, than a time-splitting strategy for train/test might make more sense, but that doesn’t seem to be the end goal here hence spatial LOOCV seems to make sense.

6. Do you have any hypotheses about Region 4 that might’ve led to RF outperforming PG-LSTM? The overall results of PG-LSTM are very strong but sometimes the failure modes are the most interesting for scientific insight!

7. Love the runtime analysis. I am assuming that “the time required to process the data for one region” is the time required for inference, i.e., a forward pass through the trained network? Or does that number also include training? If it does not include training time, it may be worth mentioning how long the networks took to train (and on what hardware) in the first place as that cost must be amortized over the later savings at inference time in practice. Vaguely related: could you map the time savings to a carbon emissions saving? Reducing computational emissions is a particularly strong argument for the expanded use of ML in climate sciences!

8. In general, more detailed figure and table captions would improve the clarity of the paper. There’s plenty of room for this! Figures and tables should ideally be able to stand alone, out of context, and remain understood.

9. I’d appreciate additional methodological detail in Section 4 on feature importance, as I think this section is one of the most scientifically insightful. How did you implement permutation? Did you permute variables within a site, or across sites, or both? Was the “increase in prediction error” on the y-axis of Figure 3 the increase in multi/univariate RMSE, or a different error metric?

Recommendation: Modeling snow on sea ice using physics-guided machine learning — R0/PR3

Comments

This article was accepted into Climate Informatics 2024 Conference after the authors addressed the comments in the reviews provided. It has been accepted for publication in Environmental Data Science on the strength of the Climate Informatics Review Process.

Decision: Modeling snow on sea ice using physics-guided machine learning — R0/PR4

Comments

No accompanying comment.