Hostname: page-component-6766d58669-kn6lq Total loading time: 0 Render date: 2026-05-19T14:14:03.438Z Has data issue: false hasContentIssue false

Applying machine learning methods to avalanche forecasting

Published online by Cambridge University Press:  14 September 2017

A. Pozdnoukhov
Affiliation:
Institute of Geomatics and Analysis of Risk, University of Lausanne, CH-1015 Lausanne, Switzerland E-mail: alexei.pozdnoukhov@unil.ch
R.S. Purves
Affiliation:
Department of Geography, University of Zürich-Irchel, Winterthurerstrasse 190, CH-8057 Zürich, Switzerland
M. Kanevski
Affiliation:
Institute of Geomatics and Analysis of Risk, University of Lausanne, CH-1015 Lausanne, Switzerland E-mail: alexei.pozdnoukhov@unil.ch
Rights & Permissions [Opens in a new window]

Abstract

Avalanche forecasting is a complex process involving the assimilation of multiple data sources to make predictions over varying spatial and temporal resolutions. Numerically assisted forecasting often uses nearest-neighbour methods (NN), which are known to have limitations when dealing with high-dimensional data. We apply support vector machines (SVMs) to a dataset from Lochaber, Scotland, UK, to assess their applicability in avalanche forecasting. SVMs belong to a family of theoretically based techniques from machine learning and are designed to deal with high-dimensional data. Initial experiments showed that SVMs gave results that were comparable with NN for categorical and probabilistic forecasts. Experiments utilizing the ability of SVMs to deal with high dimensionality in producing a spatial forecast show promise, but require further work.

Information

Type
Research Article
Copyright
Copyright © The Author(s) [year] 2008
Figure 0

Fig. 1. Schematic illustration of SVM. The validation data are not used in identifying the decision boundary.

Figure 1

Table 1. The list of features selected by recursive feature elimination algorithm, grouped by type: features related to the current or previous days and expert variables

Figure 2

Fig. 2. SVM training error surface (left) and cross-validation error surface (right). The classification error is a percentage of correctly classified data samples: (Hits + Correct Negative)/(Total Number of Samples).

Figure 3

Table 2. Forecast-verification measures (Doswell and others, 1990; Wilks, 1995)

Figure 4

Fig. 3. (a) Forecast-accuracy and (b) forecast-skill measures. The x axis corresponds to the SVM threshold.

Figure 5

Table 3. Joint distribution of forecasts and observations for binary categorical forecasts and the obtained values for default SVM threshold of 0.5 (values in parentheses are for thresholds of (0.25/ 0.75) respectively)

Figure 6

Fig. 4. The verification of probabilistic output of the SVM with reliability diagram (Wilks, 1995). The x axis indicates the probability value of the decision threshold, and the y axis the empirical probability of the observed avalanches in the days corresponding to the selected threshold. Points close to the black line have the best skill; those closer to the horizontal line have no resolution.

Figure 7

Fig. 5. The prediction of SVM for the validation data of winter 2003/04. The observed events are plotted as black boxes (or continuous series of black boxes) at 0 (no events) and 1 (avalanche activity) levels. The probabilistic output of SVM is plotted as a continuous curve. The x axis corresponds to time in days.

Figure 8

Fig. 6. (a) DEM of the Lochaber region. The locations of avalanche paths are shown with circles. (b) The sample output of the spatio-temporal SVM model, indicating the probability of the event on 20 January 1991. The actual observed events are shown with circles.

Figure 9

Fig. 7. Training error and cross-validation error curves used to identify the optimal number of neighbours.

Figure 10

Table 4. Performance measures for the SVM and NN models computed on the validation dataset of 712 days (winters 2001–07)