Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-06T23:48:08.205Z Has data issue: false hasContentIssue false

Cyber risk modeling within the SIR epidemic framework: a comparative analysis of frequency and severity methods

Published online by Cambridge University Press:  26 January 2026

Rong He
Affiliation:
Centre for Actuarial Studies, Department of Economics, The University of Melbourne, Melbourne, VIC, Australia
Zhuo Jin*
Affiliation:
Department of Actuarial Studies and Business Analytics, Macquarie University, Sydney, NSW, Australia
Shuanming Li
Affiliation:
Centre for Actuarial Studies, Department of Economics, The University of Melbourne, Melbourne, VIC, Australia
David Pitt
Affiliation:
Centre for Actuarial Studies, Department of Economics, The University of Melbourne, Melbourne, VIC, Australia
*
Corresponding author: Zhuo Jin; Email: zhuo.jin@mq.edu.au
Rights & Permissions [Opens in a new window]

Abstract

This paper addresses the gap between theoretical modeling of cyber risk propagation and empirical analysis of loss characteristics by introducing a novel approach that integrates both approaches. We model the development of cyber loss counts over time using a discrete-time susceptible-infected-recovered process, linking these counts to covariates, and modeling loss severity with regression models. By incorporating temporal and covariate-dependent transition rates, we eliminate the scaling effect of population size on infection counts, revealing the true underlying dynamics. Simulations show that this susceptible-infected-recovered framework significantly improves aggregate loss prediction accuracy, providing a more effective and practical tool for actuarial assessments and risk management in the cyber risk context.

Information

Type
Original Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of The Institute and Faculty of Actuaries
Figure 0

Table 1. Dataset feature names

Figure 1

Figure 1. Histograms of new infection and recovery counts per year.

Figure 2

Figure 2. Boxplot of new infection counts by region (top left), risk category (top right), industry (bottom left), and company size according to number of employees (bottom right).

Figure 3

Figure 3. Boxplot of log(loss) by the first year of incident occurrence.

Figure 4

Figure 4. Boxplot of loss severity by region (top left), risk category (top right), industry (bottom left), and company size according to number of employees (bottom right).

Figure 5

Table 2. Goodness-of-fit analysis – counts

Figure 6

Figure 5. Empirical and theoretical CDFs of annual infection (left) and recovery (right) counts.

Figure 7

Table 3. Goodness-of-fit analysis – severity

Figure 8

Figure 6. The empirical and estimated CDFs of loss severity.

Figure 9

Table 4. Regression results for the negative binomial GLM link functions

Figure 10

Figure 7. The number of new infection (left) and recovery (right) counts per year, predicted from Lasso-penalized GLMs with firm- and incident-specific features and $\mathbf{T}$.

Figure 11

Figure 8. The shape of transition rates.

Figure 12

Table 5. Regression results for the truncated lognormal GLM link functions

Figure 13

Figure 9. The number of infected and recovered individuals, fitted and predicted with constant model parameters.

Figure 14

Table 6. Goodness-of-fit analysis – transition rates

Figure 15

Figure 10. Empirical and theoretical CDFs of transmission (left) and recovery (right) rates.

Figure 16

Figure 11. The number of new infection (left) and recovery (right) counts per year with time-varying and age-dependent rates predicted from Lasso-regularized CLS.

Figure 17

Table 7. Regression results for the zero-inflated beta GAMLSS link functions

Figure 18

Figure 12. Transmission (left) and recovery (right) rates predicted from Lasso-regularized GAMLSS model.

Figure 19

Figure 13. Predicted number of new infection (left) and recovery (right) counts per year with Lasso-regularized GAMLSS rates.

Figure 20

Figure 14. Diagnostic plots for the penalized recovery rate model.

Figure 21

Table 8. Regression results for the zero-inflated beta GAMLSS link functions with $t$ as the only covariate

Figure 22

Figure 15. Transition rates assuming time as sole predictor.

Figure 23

Figure 16. Predicted number of new infection (left) and recovery (right) counts per year, assuming transition rates depend solely on time.

Figure 24

Figure 17. Sensitivity of estimated transmission rates to population size $N$. Bottom row: GAMLSS with dependence on $t$ only. Top row: GAMLSS with covariate dependence (Lasso-penalized).

Figure 25

Table 9. Summary statistics of empirical losses (in millions of USD), 2017–2021

Figure 26

Table 10. Simulated risk measurements (in millions of USD) in 2017–2021, assuming different frequency and severity models

Figure 27

Figure 18. Simulated aggregate losses assuming zero-inflated beta GAMLSS transmission rate and lognormal GLM severity, both include time covariate only. Top panel shows the complete histograms, while the bottom panel shows the body only.

Figure 28

Table D.1. Frequency distribution parameters

Figure 29

Table D.2. Severity distribution parameters

Figure 30

Table D.3. Rate distribution parameters

Figure 31

Table E.1. Transmission rate regression results for the zero-inflated beta GAMLSS link functions with $t$ as the only covariate, under varying $N$

Figure 32

Table E.2. Regression results for the zero-inflated beta GAMLSS link functions, under varying $N$

Figure 33

Figure E.1. Counts with GAMLSS transmission rates (dependent on $t$ only) under different $N$.

Figure 34

Figure E.2. Counts with GAMLSS transmission rates (covariate-dependent and lasso penalized) under different $N$.

Supplementary material: File

He et al. supplementary material

He et al. supplementary material
Download He et al. supplementary material(File)
File 237.9 KB