Hostname: page-component-5db58dd55d-8lnk4 Total loading time: 0 Render date: 2026-06-09T01:32:39.068Z Has data issue: false hasContentIssue false

Evaluating CMIP6 models for near-surface air temperature projections over Antarctic ice shelves

Published online by Cambridge University Press:  02 January 2026

Maya Fields*
Affiliation:
Department of Climate and Space Sciences and Engineering, University of Michigan, Ann Arbor, MI, USA
Jeremy Bassis
Affiliation:
Department of Climate and Space Sciences and Engineering, University of Michigan, Ann Arbor, MI, USA
Samuel B Kachuck
Affiliation:
Department of Climate and Space Sciences and Engineering, University of Michigan, Ann Arbor, MI, USA
Mark Moldwin
Affiliation:
Department of Climate and Space Sciences and Engineering, University of Michigan, Ann Arbor, MI, USA
Cécile Agosta
Affiliation:
Laboratoire de Sciences du Climat et de l’Environnement, LSCE-IPSL, CEA, CNRS, UVSQ, UMR8212, Université Paris Saclay, Gif-sur-Yvette, France
*
Corresponding author: Maya Fields; Email: mafields@umich.edu
Rights & Permissions [Opens in a new window]

Abstract

The meltwater-driven disintegration of the Larsen B ice shelf has raised concerns that other ice shelves may face similar vulnerabilities as global temperatures rise. Climate projections show increased ice shelf vulnerability to surface melt in the coming century, yet the ability of large-scale climate models to simulate temperatures over ice shelves—a key factor in these projections—has rarely been assessed. We address this gap by using ERA5 reanalysis data to evaluate 31 CMIP6 models’ performance in simulating near-surface air temperatures over 46 Antarctic ice shelves from 1979 to 2014. We find that CMIP6 models exhibit annual and summer warm biases over most ice shelves. There is also inter-model variability of up to 13°C between model temperatures over the Amery and Riiser-Larsen shelves for both annual and summer periods. Significant regional differences are present: shelves in the Amundsen Sea Embayment show cold biases, while those in the Weddell Sea show warm biases. While topography corrections can reduce some biases, we find notable seasonal differences, including biases with opposite signs between annual and summer means. Our results underscore the importance of careful model selection by shelf and region to improve the reliability of future climate projections and assessments of Antarctic ice shelf vulnerability.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of International Glaciological Society.
Figure 0

Figure 1. Multi-model mean near-surface air temperature, defined as the CMIP6 model near-surface air temperature minus ERA5’s near-surface air temperature, for 46 Antarctic ice shelves averaged over the period 1979-2014. (a) Annual multi-model mean near-surface air temperature bias. (b) Austral summer (DJF) multi-model mean near-surface air temperature bias. Colors indicate the degree of temperature over- or underestimation with respect to ERA5 on each ice shelf.

Figure 1

Figure 2. Box-and-whisker plots illustrating the distribution of near-surface air temperature biases for 31 CMIP6 models, relative to ERA5, spatially averaged over 46 ice shelves (with Ross divided into Ross East and Ross West) over the period 1979–2014. Panel (a) shows annual near-surface air temperature biases from CMIP6 models (shown in blue), while panel (b) displays austral summer (DJF) biases (shown in Orange) for each ice shelf. Each box represents the interquartile range (IQR), with the CMIP6 multi-model mean indicated by a horizontal line inside the box, and whiskers extending to the most extreme values within 1.5 × IQR. Filled boxes and whiskers indicate shelves where the CMIP6 multi-model mean bias is statistically significant compared to ERA5 (Mann–Whitney U test); unfilled boxes and whiskers indicate shelves where the bias is small or not significant. The vertical dashed line denotes zero bias in the multi-model mean. The red boxes indicate the ice shelves we will explore in more detail in Section 4.3.

Figure 2

Figure 3. Box-and-whisker plots showing the distribution of near-surface air temperature biases for 32 models, including 31 CMIP6 models and the multi-model mean, relative to ERA5, spatially averaged across all Antarctic ice shelves and averaged over the period 1979–2014. The model points have been jittered from a single line to visualize each of the different models within the box-and-whisker plots. The left panel displays annual (blue) mean near-surface air temperature biases, while the right panel shows austral summer (DJF) (Orange) biases. Each point represents an individual model, and the multi-model mean is represented by a white X. Boxes indicate the interquartile range (IQR), the median is marked by a horizontal line within each box, and whiskers extend to the most extreme values within 1.5 × IQR.

Figure 3

Figure 4. Pairwise comparison of near-surface air temperature biases for individual CMIP6 models (and the multi-model mean) with respect to ERA5, averaged across all Antarctic ice shelves and weighted by ice shelf area for the period 1979–2014. For each model, mean annual bias (blue triangles) and austral summer (DJF) bias (Orange circles) are shown, with lines connecting the two for each model to illustrate consistency or changes in seasonal bias. Filled symbols indicate models for which the bias is statistically significant compared to ERA5 (using the Mann–Whitney U test), while open symbols denote comparably small or non-significant biases. The vertical dashed line represents zero mean bias across all shelves.

Figure 4

Figure 5. Box-and-whisker plots showing the distribution of near-surface air temperature biases for the annual (blue) and austral summer (Orange) averaged over the period 1979–2014 for 32 models, including 31 CMIP6 models and the multi-model mean, relative to ERA5, over the Amery, Larsen B, and Riiser-Larsen ice shelves. The model points have been jittered from a single line to visualize each of the different models within the box-and-whisker plots. Each point represents an individual model, and the multi-model mean is represented by a white X. Boxes indicate the interquartile range (IQR). The median is marked by a horizontal line within each box, and whiskers extend to the most extreme values within 1.5 × IQR.

Figure 5

Figure 6. Pairwise comparison of near-surface air temperature biases for individual CMIP6 models (and the multi-model mean) with respect to ERA5, averaged over the period 1979–2014 for the a) Amery, b) Larsen B, and c) Riiser-Larsen ice shelves. For each model, mean annual bias (blue triangles) and austral summer (DJF) bias (Orange circles) are shown, with lines connecting the two for each model to illustrate consistency or changes in seasonal bias. Filled symbols indicate models for which the bias is statistically significant compared to ERA5 (using the Mann–Whitney U test), while open symbols denote comparably small or non-significant biases. The vertical dashed line represents zero mean bias.

Figure 6

Figure 7. Scatter plot showing the interannual variability and bias of near-surface air temperature for CMIP6 models, averaged across all ice shelves for the period 1979–2014. The x-axis represents model interannual variability, quantified as the standard deviation of temperature, while the y-axis shows the mean annual near-surface air temperature bias relative to ERA5, averaged across all ice shelves. Each blue circle with a black number corresponds to a CMIP6 model whose variance is not statistically different from ERA5 according to the Fligner-Killeen test. In contrast, each Orange circle indicates a model with a statistically significant difference in variance. The black numbers within the circles refer to the model numbers listed in the legend to the right of the plot. The standard deviation of ERA5’s annual near-surface air temperature, averaged across all ice shelves, is indicated by a red star. For reference, two dashed red lines are included: one horizontal line at zero on the y-axis, indicating ERA5’s zero bias with respect to itself, and one vertical line at the ERA5 standard deviation on the x-axis, facilitating a direct comparison of model variability with ERA5.

Figure 7

Figure 8. The scatter plot depicts the relationship between changes in surface orography and annual near-surface air temperature biases for 9716 grid cells (0.25° × 0.25° resolution) across 46 Antarctic ice shelves, comparing (a) the CMIP6 model NorESM2-LM to ERA5 reanalysis data and (b) the CMIP6 model MRI-ESM2-0 to ERA5 reanalysis data. The x-axis represents the change in surface orography (in meters), which is defined as the CMIP6 model surface orography minus ERA5’s surface orography, while the y-axis shows the annual near-surface air temperature bias defined as the CMIP6 model temperature minus ERA5’s temperature. The scatter plot points are colored according to the land ice area fraction of the CMIP6 model, (a) NorESM2-LM and (b) MRI-ESM2-0, with lighter points representing a higher percentage of land ice in a given grid cell. The histogram on the right side of Figs. 8a and 8b illustrates the density of points for the annual near-surface air temperature bias for the CMIP6 model, whereas the histogram on the top of Figs. 8a and 8b displays the density of points for the change in surface orography for the CMIP6 model.

Figure 8

Figure 9. ERA5 mean annual near-surface air temperature compared to the multi-model mean near-surface air temperature for 46 Antarctic ice shelves using 8 out of the 31 CMIP6 (Nor-ESM2-LM, MRI-ESM2-0, E3SM-1-0, E3SM-1-1, E3SM-1-1-ECA, MIROC6, INM-CM5-0, and INM-CM4-8) models averaged over the period 1979-2014 before and after applying a lapse rate orographic correction. (a) The original multi-model mean near-surface temperature for the 8 CMIP6 models. (b) The annual mean near-surface air temperature bias for the original multi-model mean of the 8 CMIP6 models. Colors indicate the degree of temperature over- or underestimation on each ice shelf compared to ERA5. (c) The annual mean near-surface air temperature bias for the orography-corrected multi-model mean of the 8 CMIP6 models. Colors indicate the degree of temperature over- or underestimation on each ice shelf compared to ERA5. (d) The orography corrected multi-model mean near-surface temperature for the 8 CMIP6 models. (e) The temperature difference between the original multi-model mean and the orography corrected multi-model mean near-surface air temperature, defined as the original multi-model mean minus the orography corrected multi-model mean near-surface temperature, for the 8 CMIP6 models.

Figure 9

Table A1. List of Coupled Model Intercomparison Project Phase Six (CMIP6) models used in our study, including each model’s name, country of origin, native horizontal resolution (longitude and latitude), historical timespan analyzed, specific variant label of the CMIP6 data accessed, and the corresponding reference for the historical model data utilized.

Supplementary material: File

Fields et al. supplementary material 1

Fields et al. supplementary material
Download Fields et al. supplementary material 1(File)
File 7.7 KB
Supplementary material: File

Fields et al. supplementary material 2

Fields et al. supplementary material
Download Fields et al. supplementary material 2(File)
File 9.2 KB
Supplementary material: File

Fields et al. supplementary material 3

Fields et al. supplementary material
Download Fields et al. supplementary material 3(File)
File 2 MB