Hostname: page-component-77f85d65b8-6bnxx Total loading time: 0 Render date: 2026-04-18T01:31:30.937Z Has data issue: false hasContentIssue false

Machine learning and feature selection: Applications in economics and climate change

Published online by Cambridge University Press:  15 December 2023

Berkay Akyapı*
Affiliation:
Department of Information Systems and Operations Management, University of Florida, Gainesville, FL, USA

Abstract

Feature selection is an important component of machine learning for researchers that are confronted with high dimensional data. In the field of economics, researchers are often faced with high dimensional data, particularly in the studies that aim to understand the channels through which climate change affects the welfare of countries. This work reviews the current literature that introduces various feature selection algorithms that may be useful for applications in this area of study. The article first outlines the specific problems that researchers face in understanding the effects of climate change on countries’ macroeconomic outcomes, and then provides a discussion regarding different categories of feature selection. Emphasis is placed on two main feature selection algorithms: Least Absolute Shrinkage and Selection Operator and causality-based feature selection. I demonstrate an application of feature selection to discover the optimal heatwave definition for economic outcomes, enhancing our understanding of extreme temperatures’ impact on the economy. I argue that the literature in computer science can provide useful insights in studies concerned with climate change as well as its economic outcomes.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. Hourly temperature data for June 21, 2019 obtained from ERA5 data set provided by European Centre for Medium-Range Weather Forecasts. Panel (a) provides a color map showing the maximum temperatures observed on this day in each grid cell. Panel (b) shows grid cells that have temperatures above 35$ {}^o $C in red, and temperatures below 35$ {}^o $C in gray.

Figure 1

Figure 2. Hypothetical directed acyclic graph (DAG) for weather events and GDP per capita.

Figure 2

Figure 3. Markov blanket (MB) of WE$ {}_1 $. In this example, temperature and WE$ {}_2 $ are the parents of WE$ {}_1 $, GDP per capita is the child and WE$ {}_{M-1} $ is the spouse. The dashed arrows are not part of the MB of WE$ {}_1 $, but they would be part of the MB of temperature.

Figure 3

Table 1. Definitions of heatwaves

Figure 4

Figure 4. Heatwaves in 2012––comparison of definitions requiring 6 consecutive days.

Figure 5

Figure 5. Heatwaves in 2012––comparison of definitions requiring 3 consecutive days.

Figure 6

Table 2. Summary statistics of GDP growth before and after trimming

Figure 7

Figure 6. GDP Growth in US counties between 2018 and 2019. Each panel shows the first difference of the $ \Delta \mathit{\log}{\left( Personal\ Income\; per\; Capita\right)}_t $ (in 2015 terms) for $ t=2019 $ before and after trimming the upper and lower 1 percentile.

Supplementary material: File

Akyapı supplementary material
Download undefined(File)
File 168.2 KB
Supplementary material: PDF

Akyapı supplementary material

Akyapı supplementary material

Download Akyapı supplementary material(PDF)
PDF 168.2 KB