Hostname: page-component-77f85d65b8-grvzd Total loading time: 0 Render date: 2026-03-29T10:23:53.348Z Has data issue: false hasContentIssue false

Cloudy with a chance of precision: satellite’s autoconversion rates forecasting powered by machine learning

Published online by Cambridge University Press:  11 November 2024

Maria Carolina Novitasari*
Affiliation:
Department of Electronic and Electrical Engineering, University College London, London, UK
Johannes Quaas
Affiliation:
Leipzig Institute for Meteorology, Universität Leipzig, Leipzig, Germany ScaDS.AI - Center for Scalable Data Analytics and AI, Leipzig, Germany
Miguel R. D. Rodrigues
Affiliation:
Department of Electronic and Electrical Engineering, University College London, London, UK
*
Corresponding author: Maria Carolina Novitasari; Email: maria.novitasari.20@ucl.ac.uk

Abstract

Precipitation is one of the most relevant weather and climate processes. Its formation rate is sensitive to perturbations such as by the interactions between aerosols, clouds, and precipitation. These interactions constitute one of the biggest uncertainties in determining the radiative forcing of climate change. High-resolution simulations such as the ICOsahedral non-hydrostatic large-eddy model (ICON-LEM) offer valuable insights into these interactions. However, due to exceptionally high computation costs, it can only be employed for a limited period and area. We address this challenge by developing new models powered by emerging machine learning approaches capable of forecasting autoconversion rates—the rate at which small droplets collide and coalesce becoming larger droplets—from satellite observations providing long-term global spatial coverage for more than two decades. In particular, our approach involves two phases: (1) we develop machine learning models which are capable of predicting autoconversion rates by leveraging high-resolution climate model data, (2) we repurpose our best machine learning model to predict autoconversion rates directly from satellite observations. We compare the performance of our machine learning models against simulation data under several different conditions, showing from both visual and statistical inspections that our approaches are able to identify key features of the reference simulation data to a high degree. Additionally, the autoconversion rates obtained from the simulation output and satellite data (predicted) demonstrate statistical concordance. By efficiently predicting this, we advance our comprehension of one of the key processes in precipitation formation, crucial for understanding cloud responses to anthropogenic aerosols and, ultimately, climate change.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices
Open data
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. The general framework of our proposed method. The left side of the image illustrates the climate science-based procedures we apply to our dataset to generate input-output pairs for training and testing. The center of the image represents our machine learning framework. The right side depicts the satellite observation data we used and the procedure to predict the autoconversion rates from the satellite data. 1 ICOsahedral Non-hydrostatic Large-Eddy Model; 2 Moderate Resolution Imaging Spectroradiometer.

Figure 1

Table 1. Random forest hyperparameters selected in this study

Figure 2

Figure 2. DNN architecture of our machine learning model.

Figure 3

Table 2. Scenarios to test our NN model on satellite data: MODIS data corresponding to ICON-LEM and ICON-NWP simulations with specific time and area

Figure 4

Table 3. Evaluation of the autoconversion rates prediction results on the ICON-LEM simulation model using various machine learning models—linear regression (LR), second-order polynomial regression (PR), random forest (RF), shallow neural network (NN), and deep neural network (DNN)—over Germany testing scenario (scenario 1)

Figure 5

Figure 3. This figure displays five regression plots that compare the predicted and reference simulation autoconversion rates (g m−3h−1) for various machine learning models—(a) linear regression, (b) second-order polynomial regression, (c) random forest, (d) shallow neural network, and (e) deep neural network—on the ICON-LEM Germany testing scenario (scenario 1). The x-axis shows the reference simulation values, while the y-axis shows the predicted autoconversion rates.

Figure 6

Table 4. Evaluation of the autoconversion prediction results on simulation model—ICON-LEM Germany and ICON-NWP Holuhraun—using NN model. Testing set 1 is based on ICON-LEM Germany dataset at 1 km resolution. Testing sets 2 and 3 use Cloud-top ICON-LEM Germany at 1 km resolution and Coarser Cloud-top ICON-LEM Germany, respectively. Testing set 4 uses Cloud-top ICON-NWP Holuhraun at 2.5 km resolution

Figure 7

Figure 4. Visualization of the autoconversion prediction results of ICON-LEM Germany. The left side of the image depicts the reference simulation (simulated), while the middle side shows the prediction results obtained from the shallow NN model. The right side displays the difference between the reference simulation and the prediction results. The top image (a) compares the reference simulation and predictions from ICON-LEM Germany at a resolution of 1 km, depicting each level at different heights in a single 2D map canvas, while the second image (b) focuses on cloud-top information only at a resolution of 1 km. The third figure (c) illustrates the comparison of reference simulation and predictions from ICON-LEM Germany at a coarser resolution of 2.5 km with a focus on cloud-top information only.

Figure 8

Figure 5. Visualization of the autoconversion prediction results of ICON-NWP Holuhraun. The left side of the image depicts the reference simulation (simulated), while the middle side shows the prediction results obtained from the shallow NN model. The right side displays the difference between the reference simulation and the prediction results. These three figures provide a comparison between reference simulation and predictions of the ICON-NWP Holuhraun data with a horizontal resolution of 2.5 km, focusing on cloud-top information only, and featuring different time periods.

Figure 9

Figure 6. Mean, standard deviation, median, 25th, and 75th percentiles of the cloud-top variables of both ICON and MODIS over Germany. Cloud effective radius (CER) is displayed on the left-hand side in subfigures (a) and (c), while autoconversion rate (Aut) is shown on the right-hand side in subfigures (b) and (d).

Figure 10

Figure 7. Comparison of the probability density function of autoconversion rates with different parameterizations.

Figure 11

Figure 8. Mean, standard deviation, median, 25th, and 75th percentiles of the cloud-top variables of both ICON and MODIS over Holuhraun. Cloud effective radius (CER) is displayed on the left-hand side in subfigures (a), (c), (e), (g), (i), (k), and (m), while autoconversion rate (Aut) is shown on the right-hand side in subfigures (b), (d), (f), (h), (j), (l), and (n).

Figure 12

Figure 9. Visual representation of cloud multi-layer flag of MODIS over Germany and Holuhraun.

Figure 13

Figure 10. The probability density function of Cloud-top ICON and MODIS variables over Germany on 2 May 2013 at 1:20 pm: cloud effective radius (CER in μm) and autoconversion rates (Aut in g m−3 h−1).

Figure 14

Figure 11. The probability density function of Cloud-top ICON and MODIS variables over Holuhraun on 3 September 2014 at 10 am: cloud effective radius (CER in μm) and autoconversion rates (Aut in g m−3 h−1).

Supplementary material: File

Novitasari et al. supplementary material

Novitasari et al. supplementary material
Download Novitasari et al. supplementary material(File)
File 209.9 KB