Hostname: page-component-77c78cf97d-sp94z Total loading time: 0 Render date: 2026-05-04T19:42:30.674Z Has data issue: false hasContentIssue false

CAN MACHINE LEARNING CATCH THE COVID-19 RECESSION?

Published online by Cambridge University Press:  23 June 2021

Philippe Goulet Coulombe
Affiliation:
University of Pennsylvania, Philadelphia, Pennsylvania, USA
Massimiliano Marcellino*
Affiliation:
IGIER, Baffi-Carefin, BIDSA and CEPR, Bocconi University, Milan, Italy
Dalibor Stevanović
Affiliation:
Université du Québec à Montréal and CIRANO, Montreal, Quebec, Canada
*
*Corresponding author. Email: massimiliano.marcellino@unibocconi.it
Rights & Permissions [Opens in a new window]

Abstract

Based on evidence gathered from a newly built large macroeconomic dataset (MD) for the UK, labelled UK-MD and comparable to similar datasets for the United States and Canada, it seems the most promising avenue for forecasting during the pandemic is to allow for general forms of nonlinearity by using machine learning (ML) methods. But not all nonlinear ML methods are alike. For instance, some do not allow to extrapolate (like regular trees and forests) and some do (when complemented with linear dynamic components). This and other crucial aspects of ML-based forecasting in unprecedented times are studied in an extensive pseudo-out-of-sample exercise.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© National Institute Economic Review 2021
Figure 0

Table 1. Forecasting models

Figure 1

Table 2. Interpretation of factors estimated from UK-MD, 1998 M1-2020M9

Figure 2

Figure 1. Importance of factorsNote: This figure illustrates the explanatory power of the first nine factors in the UK-MD series organised into nine groups. Group 1: labour market. Group 2: production. Group 3: retail and services. Group 4: consumer and retail price indices. Group 5: international trade. Group 6: money, credit and interest rates. Group 7: stock market. Group 8: sentiment and leading indicators. Group 9: producer price indices.

Figure 3

Figure 2. Number of factors and $ {R}^2 $ over timeNote: This figure plots the number of factors selected recursively since 2009 by the Bai and Ng (2002) $ {PC}_{p2} $ criterion (upper panel) and the corresponding total $ {R}^2 $ (bottom panel).

Figure 4

Figure 3. (Colour online) Best models for four selected targets

Figure 5

Table 3. Best COVID era models (as displayed in figure 3)

Figure 6

Figure 4. (Colour online) Generalised time-varying parameters (GTVPs) of factor-agumented autoregressive random forests (FA-ARRF)(2,2)— employment (EMP) at $ h=1 $Notes: GTVPs of the 1 month ahead EMP forecast. Persistence is defined as the sum of $ {y}_{t-1:2} $’s coefficients. The gray bands are the 68 and 90 per cent credible region. The pale orange region is the ordinary least squares (OLS) coefficient $ \pm $ one standard error. The vertical dotted line is the end of the training sample (for this graph only, not the forecasting exercise itself, which is ever-updating). Pink shading corresponds to recessions.

Figure 7

Figure 5. (Colour online) Generalised time-varying parameters (GTVPs) of autoregressive random forests (ARRF)(6)—RPI HOUSE at $ h=1 $Notes: GTVPs of the 1 month ahead employment (EMP) forecast. Persistence is defined as the sum of $ {y}_{t-1:6} $’s coefficients. The reported intercept is the long-run mean. The gray bands are the 68 and 90 per cent credible region. The pale orange region is the ordinary least squares (OLS) coefficient ± one standard error. The vertical dotted line is the end of the training sample (for this graph only, not the forecasting exercise itself, which is ever-updating). Pink shading corresponds to recessions.

Figure 8

Figure 6. (Colour online) Variable importance for random forests (RF) and boosting— producer price index of manufacturing sector (PPI MANU) at $ h=1 $Note: Comparing variable importance for boosting and RF, with and without moving average rotation of X (MARX), when forecasting PPI MANU at a 1-month horizon.

Figure 9

Table A1. All sample (2008–2020)

Figure 10

Table A2. All sample (2008–2020), continued

Figure 11

Table A3. Restricted sample (2011–2020)

Figure 12

Table A4. Restricted sample (2011–2020), continued

Figure 13

Table A5. Covid sample (from 2020 M1)

Figure 14

Table A6. Covid sample (from 2020 M1), continued

Figure 15

Table A7. Quiet(er) period (2011–2019)

Figure 16

Table A8. Quiet(er) period (2011–2019), continued

Figure 17

Table A9. Pre-Covid (2008–2019)

Figure 18

Table A10. Pre-Covid (2008–2019), continued

Figure 19

Figure B1. (Colour online) Variable importance measures for factor-agumented autoregressive random forests (FA-ARRF)(2,2)—total actual weekly hours worked (HOURS) at $ h=1 $Notes: 20 most important series according to the various variable importance (VI) criteria. Units are relative root mean squre error (RMSE) gains (in percentage) from including the specific predictor in the forest part. $ {\mathrm{VI}}_{OOB} $ means VI for the out-of-bag criterion. $ {\mathrm{VI}}_{OOS} $ is using the hold-out sample. $ {\mathrm{VI}}_{\beta } $ is an out-of-bag measure of how much $ {\beta}_{t,k} $ varies by withdrawing a certain predictor.

Figure 20

Figure B2. (Colour online) Variable importance measures for autoregressive random forests (ARRF)(6)—RPI HOUSE at $ h=1 $Notes: 20 most important series according to the various variable importance (VI) criteria. Units are relative root mean squre error (RMSE) gains (in percentage) from including the specific predictor in the forest part. $ {\mathrm{VI}}_{OOB} $ means VI for the out-of-bag criterion. $ {\mathrm{VI}}_{OOS} $ is using the hold-out sample. $ {\mathrm{VI}}_{\beta } $ is an out-of-bag measure of how much $ {\beta}_{t,k} $ varies by withdrawing a certain predictor.

Figure 21

Figure B3. (Colour online) Full pseudo out of sample forecasts for RPI HOUSING at $ h=1 $Notes: Pink shading corresponds to recessions. Exact selected models are reported in table 3.

Figure 22

Figure 23