Hostname: page-component-89b8bd64d-n8gtw Total loading time: 0 Render date: 2026-05-09T09:25:40.066Z Has data issue: false hasContentIssue false

Aggregation strategies to improve XAI for geoscience models that use correlated, high-dimensional rasters

Published online by Cambridge University Press:  13 December 2023

Evan Krell*
Affiliation:
Department of Computer Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA Innovation in COmputer REsearch Lab (iCORE), Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA Conrad Blucher Institute for Surveying and Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
Hamid Kamangir
Affiliation:
Conrad Blucher Institute for Surveying and Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
Waylon Collins
Affiliation:
National Weather Service, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
Scott A. King
Affiliation:
Department of Computer Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA Innovation in COmputer REsearch Lab (iCORE), Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
Philippe Tissot
Affiliation:
Conrad Blucher Institute for Surveying and Science, Texas A&M University - Corpus Christi, Corpus Christi, Texas, USA NSF AI Institute for Research on Trustworthy AI in Weather, Climate and Coastal Oceanography
*
Corresponding author: Evan Krell; Email: ekrell@islander.tamucc.edu

Abstract

Complex machine learning architectures and high-dimensional gridded input data are increasingly used to develop high-performance geoscience models, but model complexity obfuscates their decision-making strategies. Understanding the learned patterns is useful for model improvement or scientific investigation, motivating research in eXplainable artificial intelligence (XAI) methods. XAI methods often struggle to produce meaningful explanations of correlated features. Gridded geospatial data tends to have extensive autocorrelation so it is difficult to obtain meaningful explanations of geoscience models. A recommendation is to group correlated features and explain those groups. This is becoming common when using XAI to explain tabular data. Here, we demonstrate that XAI algorithms are highly sensitive to the choice of how we group raster elements. We demonstrate that reliance on a single partition scheme yields misleading explanations. We propose comparing explanations from multiple grouping schemes to extract more accurate insights from XAI. We argue that each grouping scheme probes the model in a different way so that each asks a different question of the model. By analyzing where the explanations agree and disagree, we can learn information about the scale of the learned features. FogNet, a complex three-dimensional convolutional neural network for coastal fog prediction, is used as a case study for investigating the influence of feature grouping schemes on XAI. Our results demonstrate that careful consideration of how each grouping scheme probes the model is key to extracting insights and avoiding misleading interpretations.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. The FogNet input (a) has 384 channels, forming ifve groups of physically-related variables (b).

Figure 1

Figure 2. Various geometric schemes to group 3D rasters. The most granular is the (a) raster itself where no grouping is applied. Each spatial element can be grouped into a (b) pixel that contains all channels at that (row, column) location. Adjacent pixels may be combined into coarser (c) superpixels. Similarly, adjacent (d) channels may be aggregated into (e) channel groups. Within each channel, the elements may be aggregated into (f) channel-wise superpixels (CwSPs).

Figure 2

Figure 3. The hierarchy is defined by the partition tree that is generated by recursively splitting the raster. An example partition tree for a single channel, shown to a depth of 4, is given in (a). The white elements indicate the superpixel at that node. The tree continues until the leaf nodes are single (row, col) elements. Owen values (b) are calculated recursively, where each superpixel is evaluated based on comparisons with the elements in its larger group either present or absent.

Figure 3

Figure 4. PartitionSHAP’s default scheme for building a partition tree. Given a raster (a), the rows and columns are alternatively halved (nearest integer). (b) demonstrates a row split that divides vertically into two groups. This is followed by a column split (c) dividing each horizontally. This process recursively builds a tree where each group is a node whose children are the two groups formed by splitting it.

Figure 4

Figure 5. The form channel-wise SuperPixels (CwPS), the input raster (a) is initially divided along the channels. (b) shows the result of a single channel split, dividing the raster into two halves (or, the nearest integer). When the partitioning reaches a single-channel group, it begins recursively diving along the rows and channels as before. (c) shows the result of row splits performed on all three channels.

Figure 5

Figure 6. SHAP feature effect results for the 5 groups. SHAP values for each group are calculated for each of the 2228 cases and the violin plots represent the distribution of the SHAP value for those cases. (a) lists all 2228 cases, while (b) aggregates based on the outcome.

Figure 6

Figure 7. Top $ {T}_3 $ CwPS spatial aggregates, ranked by absolute SHAP. Red (blue) means pushing towards a fog (non-fog) decision. R/A-R is radiation and advection-radiation fog, and A is advection fog. For reference, an outline of the coastline is shown in (w).

Figure 7

Figure 8. Spatial-channel aggregates of CwPS results are summed along channels to yield 2D explanations. Left of the dotted curve is land and right is water. The star indicates the fog target location (KRAS). Red (blue) means pushing towards a fog (non-fog) decision. R/A-R is radiation and advection-radiation fog, and A is advection fog.

Figure 8

Figure 9. CwPS channel rankings. When summing the superpixel SHAP values, the disproportionate influence of G4 and G5 channels causes channels in other groups to virtually disappear. We instead ranked channels based on the number of times that a channel appears within the top 50 channels.

Figure 9

Figure 10. Feature importance at three granularities. To compare consistency across grouping schemes, the more granular explanations are aggregated into coarser ones. (a) Each column corresponds to the grouping scheme, and each row corresponds to an aggregation granularity. (a) shows the top 15 channels based on PFI performed on CwSP (left-to-right, top-to-bottom). (b and c) Showing individual features. (d and e) Showing the prior row aggregated by group, and e shows methods used on the five groups.

Figure 10

Table 1. Top 15 $ {t}_3 $ channels ranked with channel-wise (Cw) and CwSP schemes