Hostname: page-component-77f85d65b8-zzw9c Total loading time: 0 Render date: 2026-04-19T03:18:48.146Z Has data issue: false hasContentIssue false

A climatic random forest model of agricultural insurance loss for the Northwest United States

Published online by Cambridge University Press:  20 December 2022

Erich Seamon*
Affiliation:
Institute for Modeling, Collaboration, and Innovation, University of Idaho, Moscow 83844, Idaho, USA
Paul E. Gessler
Affiliation:
Department of Forest, Rangeland, and Fire Sciences, University of Idaho, Moscow 83844, Idaho, USA
John T. Abatzoglou
Affiliation:
Department of Management of Complex Systems, University of California-Merced, Merced 95343, California, USA
Philip W. Mote
Affiliation:
The Graduate School, Oregon State University, Corvallis 97331, Oregon, USA
Stephen S. Lee
Affiliation:
Department of Mathematics and Statistical Science, University of Idaho, Moscow 83844, Idaho, USA
*
*Corresponding author. E-mail: erichs@uidaho.edu

Abstract

We compared climatic relationships to insurance loss across the inland Pacific Northwest region of the United States, using a design matrix methodology, to identify optimum temporal windows for climate variables by county in relationship to wheat insurance loss due to drought. The results of our temporal window construction for water availability variables (precipitation, temperature, evapotranspiration, and the Palmer drought severity index [PDSI]) identified spatial patterns across the study area that aligned with regional climate patterns, particularly with regards to drought-prone counties of eastern Washington. Using these optimum time-lagged correlational relationships between insurance loss and individual climate variables, along with commodity pricing, we constructed a regression-based random forest model for insurance loss prediction and evaluation of climatic feature importance. Our cross-validated model results indicated that PDSI was the most important factor in predicting total seasonal wheat/drought insurance loss, with wheat pricing and potential evapotranspiration having noted contributions. Our overall regional model had a $ {R}^2 $ of 0.49, and a RMSE of $30.8 million. Model performance typically underestimated annual losses, with moderate spatial variability in terms of performance between counties.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. 24-county inland Pacific Northwest (iNPW) study area, which includes counties from Washington, Idaho, and Oregon.

Figure 1

Figure 2. Correlation matrices for annual insurance loss due to drought and all four climatic variables: A) maximum temperature, B) potential evapotranspiration, C) precipitation, and D) the Palmer drought severity index for an example county (Whitman County, WA). The x-axis is the number of months of climate data aggregated and the y-axis is the last month of climate data. Each cell represents the correlation between climate data and a county’s annual insurance loss for wheat. For example, July 3 represents the correlation between annual loss and climate data for the months of May, June, and July. Though shown here for just one county, this calculation was performed for each county within the study area, across the 2001–2015 time period. Time windows that had the highest correlations (denoted above with an asterisk) for each county were used in subsequent random forest predictive modeling. A table with all county results can be found in the supplementary materials.

Figure 2

Figure 3. Spatial plot of correlation between insurance loss due to drought, for A) maximum temperature, B) potential evapotranspiration, C) precipitation and D) the Palmer drought severity index, across all counties within the study area, 2001–2015.

Figure 3

Figure 4. Historical versus predicted annual wheat insurance loss ($) due to drought, constructed using a random forest model, for the 24-county iPNW study area. Input variables were precipitation, maximum temperature, and potential evapotranspiration, as well as annual wheat pricing, from 2001 to 2015. Climate variables were refined using the aforementioned time-lagged correlation methodology ($ {R}^2 $ = 0.49).