Hostname: page-component-89b8bd64d-n8gtw Total loading time: 0 Render date: 2026-05-09T22:27:46.344Z Has data issue: false hasContentIssue false

A physics-informed machine learning parameterization for cloud microphysics in ICON

Published online by Cambridge University Press:  27 August 2025

Ellen Sarauer*
Affiliation:
Deutsches Zentrum für Luft und Raumfahrt e.V., Institut für Physik der Atmosphäre , Oberpfaffenhofen, Germany
Mierk Schwabe
Affiliation:
Deutsches Zentrum für Luft und Raumfahrt e.V., Institut für Physik der Atmosphäre , Oberpfaffenhofen, Germany
Philipp Weiss
Affiliation:
University of Oxford, Department of Physics, Atmospheric, Oceanic and Planetary Physics, Oxford, UK
Axel Lauer
Affiliation:
Deutsches Zentrum für Luft und Raumfahrt e.V., Institut für Physik der Atmosphäre , Oberpfaffenhofen, Germany
Philip Stier
Affiliation:
University of Oxford, Department of Physics, Atmospheric, Oceanic and Planetary Physics, Oxford, UK
Veronika Eyring
Affiliation:
Deutsches Zentrum für Luft und Raumfahrt e.V., Institut für Physik der Atmosphäre , Oberpfaffenhofen, Germany University of Bremen, Institute of Environmental Physics , Bremen, Germany
*
Corresponding author: Ellen Sarauer; Email: ellen.sarauer@dlr.de

Abstract

We developed a cloud microphysics parameterization for the icosahedral nonhydrostatic modeling framework (ICON) model based on physics-informed machine learning (ML). By training our ML model on high-resolution simulation data, we enhance the representation of cloud microphysics in Earth system models (ESMs) compared to traditional parameterization schemes, in particular by considering the influence of high-resolution dynamics that are not resolved in coarse ESMs. We run a global, kilometer-scale ICON simulation with a one-moment cloud microphysics scheme, the complex graupel scheme, to generate 12 days of training data. Our ML approach combines a microphysics trigger classifier and a regression model. The microphysics trigger classifier identifies the grid cells where changes due to the cloud microphysical parameterization are expected. In those, the workflow continues by calling the regression model and additionally includes physical constraints for mass positivity and water mass conservation to ensure physical consistency. The microphysics trigger classifier achieves an F1 score of 0.93 on classifying unseen grid cells. The regression model reaches an $ {R}^2 $ score of 0.72 averaged over all seven microphysical tendencies on simulated days used for validation only. This results in a combined offline performance of 0.78. Using explainability techniques, we explored the correlations between input and output features, finding a strong alignment with the graupel scheme and, hence, physical understanding of cloud microphysical processes. This parameterization provides the foundation to advance the representation of cloud microphysical processes in climate models with ML, leading to more accurate climate projections and improved comprehension of the Earth’s climate system.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open materials
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Map of mmrs. Obtained from the simulation on model level 70, corresponding to a height of about 3 km. The upper figure shows water vapor (red), cloud water (blue), and cloud ice (green). The lower figure shows rain (red), snow (blue), and graupel (green). Earth illustration from NASA Visible Earth (2024).

Figure 1

Figure 2. Overview of the presented ML parameterization pipeline. Left, the input variables. Right, the output variables are listed, and for more details, see Table A1. In the center, step 1 illustrates the classifier MLP, and step 2 illustrates the regression MLP.

Figure 2

Figure 3. Classification performance. The left image shows the confusion matrix, comparing predicted and true classes. The color intensity represents the ratio of samples in each class to the total number for that class. The right image shows the Shapley analysis. The input variables are listed in importance for the respective output variable from top to bottom. For each feature, red represents large values of a variable, and blue represents small ones. The x-axis shows the Shapley values.

Figure 3

Table 1. Fit performance measures for the regression output in the ML framework

Figure 4

Figure 4. Regression performance. The left image shows the ML predictions of the regression model versus ground truth of the tendency of cloud ice mmr. The colors illustrate the density of the data on a logarithmic scale. The right image shows the Shapley analysis for the tendency of cloud ice mmr. The input variables are listed in importance for the respective output variable from top to bottom. For each feature, red represents large values of a variable, and blue represents small ones. The x-axis shows the Shapley values.

Figure 5

Figure 5. ML prediction (left) of the tendency of graupel mmr averaged over 3 validation days (24 timesteps in total) compared with the ground truth (center) and a coarse reference simulation (right) for a random day within the validation period. Colors represent the magnitude of the mmr tendency, averaged over longitude. The x-axis is latitude, the y-axis is height, and white areas indicate no change due to microphysics.

Figure 6

Figure A1. Pearson autocorrelation matrix for all raw input and output parameters. For an explanation of the variable short names, the reader is referred to Table A1. The color bar ranges from negative correlation (blue) over no correlation (white) to positive correlation (red).

Figure 7

Figure A2. Regression performance. The left images show the ML predictions of the regression model versus ground truth of the tendency of temperature, water vapor mmr, and cloud water mmr. The colors illustrate the density of the data on a logarithmic scale. The right images show the Shapley analysis for the tendency of temperature, water vapor mmr, and cloud water mmr. The input variables are listed in importance for the respective output variable from top to bottom. For each feature, red represents large values of a variable, and blue represents small ones. The x-axis shows the Shapley values.

Figure 8

Figure A3. Regression performance. The left images show the ML predictions of the regression model versus ground truth of the tendency of rain mmr, snow mmr, and graupel mmr. The colors illustrate the density of the data on a logarithmic scale. The right images show the Shapley analysis for the tendency of rain mmr, snow mmr, and graupel mmr. The input variables are listed in importance for the respective output variable from top to bottom. For each feature, red represents large values of a variable, and blue represents small ones. The x-axis shows the Shapley values.

Figure 9

Figure A4. ML prediction (left) of the tendency of temperature, water vapor, and cloud water mmr averaged over 3 validation days (24 timesteps in total) compared with the ground truth (center) and a coarse reference simulation (right) for a random day within the validation period. Colors represent the magnitude of the mmr tendency, averaged over longitude. The x-axis is latitude, the y-axis is height, and white areas indicate no change due to microphysics.

Figure 10

Figure A5. ML prediction (left) of the tendency of cloud ice, rain, and snow mmr averaged over 3 validation days (24 timesteps in total) compared with the ground truth (center) and a coarse reference simulation (right) for a random day within the validation period. Colors represent the magnitude of the mmr tendency, averaged over longitude. The x-axis is latitude, the y-axis is height, and white areas indicate no change due to microphysics.

Figure 11

Figure A6. Histograms of the microphysical tendencies averaged over 3 validation days (24 timesteps in total) for the high-resolution ground truth (gray), coarse-resolution simulation (green), and ML predictions (red). The histograms are standardized, and the number of entries is shown on a logarithmic scale to improve readability.

Figure 12

Table A1. Overview of all considered input and output features of the MLP model

Author comment: A physics-informed machine learning parameterization for cloud microphysics in ICON — R0/PR1

Comments

From:

Ellen Sarauer (German Aerospace Center)

To:

Joachim Denzler (Friedrich Schiller University Jena)

Patrick Emami (National Renewable Energy Laboratory)

Emily Gordon (Stanford University)

Panayiotis Moutis (City College New York)

Zoltan Nagy (University of Texas Austin)

Tejasri Nampally (Climate Change AI)

Mark Roth (Climate Change AI)

Environmental Data Science Journal

Cambridge University Press

02.09.2024

Dear Environmental Data Science Editorial Team,

we are pleased to submit our paper: “A physics-informed machine learning parameterization for cloud microphysics in ICON” to be considered for publication in the Environmental Data Science journal as part of the special collection “Tackling Climate Change with Machine Learning”.

We propose an application paper within the scope of improving climate modelling focusing on the atmosphere with machine learning. With our work, we aim to enhance the accuracy of climate models and deepen our understanding of subgrid-scale cloud processes. Therefore, our paper presents a new, data-driven cloud microphysics parameterization. The research presented in this paper builds upon our previous work, which we initially presented at the ICLR 2024 CCAI workshop, and has been significantly extended in the last months.

We employ advanced machine learning methods and achieve promising results by training and evaluating on high-resolution Icosahedral Non-hydrostatic modeling framework (ICON) simulations. Therefore, we improve cloud microphysical process representations by particularly taking into account the influence of highly resolved dynamics on cloud microphysical processes. We use explainability techniques to explore the correlations between input and output features in each step of our architecture, finding a strong alignment with physical understanding of cloud microphysical processes. Additionally, we use physics-constrained loss functions to ensure for water mass conservation and mass positivity. These methods will contribute to a better understanding of climate-relevant processes, once coupled with other ML-based parameteriztions to advanced hybrid Earth system models. Our work is relevant in the scope of climate change, because more accurate and robust climate projections are essential for informed decision-making and risk assessment in addressing the global warming crisis.

The article “A physics-informed machine learning parameterization for cloud microphysics in ICON” has not been published in another journal and reflects original research conducted by the authors. None of the authors have any competing interest concerning this paper.

We believe our paper would be a strong fit for your journal and would contribute valuable insights to the special collection.

Thank you for considering our submission. We look forward to your feedback.

Sincerely,

Ellen Sarauer

Institut für Physik der Atmosphäre

Deutsches Zentrum für Luft- und Raumfahrt (German Aerospace Center)

E-Mail: ellen.sarauer@dlr.de

Review: A physics-informed machine learning parameterization for cloud microphysics in ICON — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

*Summary*

This paper introduces a machine learning based cloud microphysics parameterization for the ICON model. It’s trained on 12 days of simulation data from a global, kilometer-scale ICON simulation with a one-moment microphysics scheme (complex graupel scheme). Using a two-stage classifier-regression setup, they achieve a F1 score of .93 on classifying unseen grid cells and an average R squared of .72 for all seven microphysical tendencies.

This model also incorporates physical constraints that ensure water mass conservation for the microphysical hydrometeors and sets (predicted) negative masses to zero. For interpretability, the authors make use of Shapley values for both the classifier and the regressor to confirm that input features that contribute most to model predictions align with physics-based expectations.

*Judgment for overall quality and suitability for EDS*

The paper introduces a well-thought out approach to machine learning based cloud microphysics parameterization. It is suitable for publication in Environmental Data Science with minor revisions.

*Evaluation for whether data is technically correct and scientifically sound*

The paper is technically and scientifically sound, although I am not sure using a weighted average of the F1 score and the R squared is the best way to benchmark the overall offline performance of the parameterization. Ideally, a reproducible benchmark should allow for easy intercomparison of alternate approaches that make use of a unified regression model (where negative classifications are mapped to zero).

*Overall assessment of whether the paper is written clearly and whether length is appropriate*

The paper is written very clearly, and the associated github repo is clean, well-documented, and exemplary for future work.

*General suggestions for improving the paper*

It would be nice to include online results, but it’s also perfectly understandable to save this for a follow-up paper given the technical complexity involved.

Using a different metric than a weighted average of the F1 score and R squared is probably ideal as it is highly dependent on fraction of active versus passive cells, which could change in future work.

*Detailed suggestions for improving the paper*

Instead of using a weighted average of the F1 score and R squared, perhaps an alternative approach would be to treat classification of negative cases as regression mapped to zero and show a unified R squared score. This unified R squared score should also take into account samples where the regressor was incorrectly activated.

“After coarse-graining, we select a random subset within the first 9 days after spin-up of our simulation to use for training and testing. We validate our method on the 3 remaining days.” Did you mean to say 9 days are used for training and validation, and the 3 remaining days are used for testing? This would make more sense.

Review: A physics-informed machine learning parameterization for cloud microphysics in ICON — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

Overall, this study presents an interesting approach to advancing cloud microphysical representation in climate models using machine learning based-parameterizations. The writing is mostly clear, and the subject matter is appropriate for Environmental Data Science. The authors build on recent work in the field by investigating how a convective-permitting model with a single-moment microphysics scheme that includes graupel at high resolution can be emulated at the coarser scale of an Earth System Model (ESM). This approach could potentially improve the consistency of cloud representation across scales. However, methodologically there are some significant limitations to this approach which are not really addressed or discussed in detail in the current draft. These methodological choices should be more fully justified and explained before the paper is ready for publication.

The major limitation of this study is that emulating a biased model (the one moment cloud microphysics scheme) at higher resolution has less clear benefits than emulating the sub-grid-scale parameterizations for processes such as deep convection where going to the higher resolution model removes the need to parameterize these processes. For this reason, previous studies on ML-based parameterizations for cloud microphysics have focused on improving individual microphysical process rates such as collision coalescence by emulating higher complexity microphysical models, which is more clearly supported by the physical basis of these process rates.

Conceptually there’s an assumption that higher resolution will improve process level representation, but for microphysics it’s not that clear how this would be the case. Moreover, what is learned by the machine learning model at the coarser scales will not solely be due to the impact of microphysics, but will instead be a combination of both larger scale cloud processes that are not resolved at the coarse scale and cloud microphysical processes. Microphysical process complexity and model resolution are separate issues that complicate the interpretation and emulation of the sub-grid-scale tendencies for cloud microphysics.

The other limitation of the study is that the focus here is on off-line performance only, which leaves questions about how well this model will perform when actually integrated online into an ESM and run over multiple time-steps. This will evidently be the focus of future work, but it does mean the current study is relatively limited in terms of its assessment of this method.

General Comments:

Why not use a more advanced two moment microphysics scheme? Presumably this is due to computational limitations of the convective permitting model, but it should be made clearer that single moment microphysics schemes have known limitations and biases. It would be useful to provide more justifications for the choice of the microphysics scheme in the paper.

The coarse-graining approach seems potentially problematic for cloud microphysics due to the issues discussed above. Is the coarse graining done on both the mass mixing ratios and the tendencies? Is cloud fraction included in some way?

Is the temperature tendency predicted by the model only due to temperature changes related to microphysical processes? This should be made clearer if so.

It would be useful to discuss (or show) the distributions of the microphysical tendencies in the training and test data sets, as well as in the coarse and fine scale simulations.

Did you try training the classification and regression tasks end to end?

Specific Comments:

P. 4, lines 13-14. Are the instantaneous tendencies for the microphysical processes saved at the 3 hourly scale?

P. 5. Line 42. There is a typo in “especially”.

Figure 4. The x and y axis labels on the left panel of this figure are confusing and appear to be non-monotonic—can you check whether there’s a mistake with the labels here? This also appears to be a problem in Figures 7 and 8.

Lines 52 on P. 8 mentions experiments with various scaling methods and it would be useful to discuss this in more detail. Are inputs scaled using a log transformation? Was any scaling of the inputs investigated here?

P. 10 Lines 36 – 38. Does the coarse-grained model use the one moment microphysics scheme that includes graupel?

Recommendation: A physics-informed machine learning parameterization for cloud microphysics in ICON — R0/PR4

Comments

I have received two reviewer reports for ”A physics-informed machine learning parameterization for cloud microphysics in ICON". From these reports, major revisions are required before publication of this manuscript. I am therefore returning the manuscript to you so you may make the changes suggested by the reviewers.

Decision: A physics-informed machine learning parameterization for cloud microphysics in ICON — R0/PR5

Comments

No accompanying comment.

Author comment: A physics-informed machine learning parameterization for cloud microphysics in ICON — R1/PR6

Comments

Dear Editor Dr. Gordon,

We would like to thank you and the reviewers for the time spent on reviewing our

manuscript, and for the valuable comments which help improving the quality of our

manuscript. We have carefully addressed the reviewers’ comments.

A summary of main modifications and a detailed point-by-point response to your com-

ments and to the comments from Reviewer #1 and Reviewer #2 are given below. Please

find enclosed the revised version of our manuscript “A physics-informed machine learn-

ing parameterization for cloud microphysics in ICON” (EDS-2024-0047), with changes

marked in red. We hope that the manuscript can now be accepted for publication.

Sincerely,

On behalf of all authors,

Ellen Sarauer

Review: A physics-informed machine learning parameterization for cloud microphysics in ICON — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

The authors of this paper have addressed all reasonable concerns well, and I have no further comments. I look forward to seeing online results in future work, as this is a critical test for the effectiveness of machine learning parameterizations.

Review: A physics-informed machine learning parameterization for cloud microphysics in ICON — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

The authors have expanded their discussion to include more details on the limitations of their particular approach. While I still believe there are significant conceptual limitations to this approach for parameterizing microphysics at the sub-grid-scale, I do recognize that this current research provides a step forward, and thus recommend that it be published. I have a couple of additional questions:

1.) Since the dynamics are now in some sense incorporated with the microphysics, did you explore including additional inputs into the tendency prediction, such as the vertical updraft speed?

2.) When looking at the distributions of the microphysical tendencies predicted by the ML method and the coarse-grained version, it looks like the ML model under-predicts the extremes of ice but over-predicts the extremes of cloud water. This could lead to some significant biases when implementing this ML model online, and it would be good to address this limitation in the text.

Recommendation: A physics-informed machine learning parameterization for cloud microphysics in ICON — R1/PR9

Comments

No accompanying comment.

Decision: A physics-informed machine learning parameterization for cloud microphysics in ICON — R1/PR10

Comments

No accompanying comment.

Author comment: A physics-informed machine learning parameterization for cloud microphysics in ICON — R2/PR11

Comments

Dear Editor Dr. Gordon,

We would like to thank you and the reviewers again for the time spent on reviewing our manuscript, and for the valuable comments which help improving the quality of our manuscript. We have carefully addressed the reviewers' comments.

In this work, we present a cloud microphysics parameterization based on physics-informed machine learning (ML) for the Icosahedral Non-hydrostatic (ICON) model. Our findings suggest improvements in the representation of cloud microphysics in Earth System Models (ESMs), particularly by incorporating high-resolution dynamics that are typically unresolved in coarse ESMs. This approach offers a promising pathway for advancing climate projections and enhancing our understanding of cloud microphysical processes.

Sincerely,

On behalf of all authors,

Ellen Sarauer

Review: A physics-informed machine learning parameterization for cloud microphysics in ICON — R2/PR12

Conflict of interest statement

Reviewer declares none.

Comments

The authors have addressed my additional comments and in my opinion the manuscript is ready to publish.

Recommendation: A physics-informed machine learning parameterization for cloud microphysics in ICON — R2/PR13

Comments

No accompanying comment.

Decision: A physics-informed machine learning parameterization for cloud microphysics in ICON — R2/PR14

Comments

No accompanying comment.