Hostname: page-component-5db58dd55d-xnzfm Total loading time: 0 Render date: 2026-06-01T12:41:16.582Z Has data issue: false hasContentIssue false

Using satellite-derived wetness and topographical data to predict the spatial distribution of biota in the McMurdo Dry Valleys, Antarctica

Published online by Cambridge University Press:  12 February 2026

Lars Brabyn*
Affiliation:
School of Arts, University of Waikato , Hamilton, New Zealand
T.G. Allan Green
Affiliation:
School of Science, University of Waikato , Hamilton, New Zealand Botany Department, Faculty of Pharmacy, Universidad Complutense, Madrid, Spain
Ian D. Hogg
Affiliation:
School of Science, University of Waikato , Hamilton, New Zealand Canadian High Arctic Research Station, Polar Knowledge Canada , Cambridge Bay, Nunavut, Canada
Marius Tsui
Affiliation:
School of Arts, University of Waikato , Hamilton, New Zealand
*
Corresponding author: Lars Brabyn; Email: larsb@waikato.ac.nz
Rights & Permissions [Opens in a new window]

Abstract

The McMurdo Dry Valleys in the Ross Sea Region of Antarctica represent a globally unique desert ecosystem where water availability is likely to change under global warming scenarios, thus influencing the distribution and abundance of biota. Using the Random Forest machine learning model, we focused on the spatial distribution of macroscopic terrestrial biota (moss, cyanobacteria, lichen, springtails and mites) in the Dry Valleys. A wetness index, explicitly driven by satellite-derived glacier surface temperatures and meltwater routing, was used along with biological survey data collected over six field seasons (2009–2014) as part of the New Zealand Terrestrial Biocomplexity Survey (n = 886 sites). Our analyses use the full extent of survey data available and include the larger Taylor, Wright and Victoria valleys, as well as data from the previously studied Miers, Marshall and Garwood valleys. The overall model accuracies were mixed (kappa statistic: 0.34% and 17.3% variance explained). However, the resulting predictive maps derived from the model and the influence of the different explanatory variables align with field observations and theoretical expectations. The models show that distance from coast was an important driver for the biota, as well as elevation and temperature. The predictive maps provide an initial model of the distribution of biota in the Dry Valleys and can guide future sampling as well as inform conservation and management strategies. Our research highlights the importance of biological survey data for use in spatial predictive modelling as well as the need to obtain representative samples from a wide range of different habitats (e.g. wet vs dry).

Information

Type
Biological Sciences
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Antarctic Science Ltd
Figure 0

Figure 1. Location of the study area in the Ross Sea region of Antarctic (inset map) and the survey points used in the New Zealand Terrestrial Antarctic Biocomplexity Survey (NZTABS) study. The different valleys surveyed are represented by different colours, as indicated in the figure legend.

Figure 1

Table I. Summary of the New Zealand Terrestrial Antarctic Biocomplexity Survey (NZTABS) field surveys and the number of recorded locations for each survey as well as the number of locations used for the current study.

Figure 2

Table II. Response variables used as well as the number of survey points present or absent for each variable.

Figure 3

Table III. Explanatory variables used for analyses including abbreviations used, data source and original spatial resolution of the data.

Figure 4

Figure 2. Overview of modelling process showing model inputs, model development and outputs of the model. AI = artificial intelligence; NSTABS = New Zealand Terrestrial Antarctic Biocomplexity Survey.

Figure 5

Figure 3. Correlation matrix and data distribution for variables shown along the diagonal of the matrix. Abbreviations for variables correspond to those listed in Table III. The sampling data and distributions for each of the variables used as part of the modelling are named and shown along the diagonal of the figure. Above the diagonal shows the correlation matrix, with the magnitude of the correlation (R values) reflected by font size: larger fonts indicate stronger correlations (both positive and negative). The level of significance for the correlations is represented by the number of asterisks: *P < 0.1, **P < 0.05, ***P < 0.01. Below the diagonal are the individual scatter plots and/or data distribution plots (red lines).

Figure 6

Table IV. Model performance based on the average of 100 different seed values. Negative regression values indicate that the performance of the model is lower than that of a null model.

Figure 7

Figure 4. Examples of spatial prediction maps of biota from the study area in the McMurdo Dry Valleys showing a. cyanobacteria percentage cover, b. springtail presence/absence, c. lichen percentage cover and d. BioIndex 2 presence/absence.

Figure 8

Figure 5. SHapley Additive exPlanations (SHAP) graphs for BioIndex 2 (based on presence/absence), lichens (based on percentage cover), mosses (based on percentage cover), cyanobacteria (based on percentage cover), mites (based on presence/absence) and springtails (Collembola; based on presence/absence). Values to the left of zero indicate a negative association, while those to the right are positive. The feature value colour bar indicates the relative strength of the association from high (yellow) to low (purple). Variables are listed in order of importance relative to each of the six biotic types.