Hostname: page-component-89b8bd64d-z2ts4 Total loading time: 0 Render date: 2026-05-07T18:17:28.374Z Has data issue: false hasContentIssue false

The use of autoencoders for training neural networks with mixed categorical and numerical features

Published online by Cambridge University Press:  24 April 2023

Łukasz Delong*
Affiliation:
SGH Warsaw School of Economics, Institute of Econometrics, Warsaw, Poland
Anna Kozak
Affiliation:
Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
*
Corresponding author: Łukasz Delong; Email: lukasz.delong@sgh.waw.pl
Rights & Permissions [Opens in a new window]

Abstract

We focus on modelling categorical features and improving predictive power of neural networks with mixed categorical and numerical features in supervised learning tasks. The goal of this paper is to challenge the current dominant approach in actuarial data science with a new architecture of a neural network and a new training algorithm. The key proposal is to use a joint embedding for all categorical features, instead of separate entity embeddings, to determine the numerical representation of the categorical features which is fed, together with all other numerical features, into hidden layers of a neural network with a target response. In addition, we postulate that we should initialize the numerical representation of the categorical features and other parameters of the hidden layers of the neural network with parameters trained with (denoising) autoencoders in unsupervised learning tasks, instead of using random initialization of parameters. Since autoencoders for categorical data play an important role in this research, they are investigated in more depth in the paper. We illustrate our ideas with experiments on a real data set with claim numbers, and we demonstrate that we can achieve a higher predictive power of the network.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of The International Actuarial Association
Figure 0

Table 1. Features used in our experiments.

Figure 1

Figure 1. Architecture of the autoencoder for numerical features used in the paper.

Figure 2

Figure 2. Architecture of the autoencoder of type Separate AEs for categorical features.

Figure 3

Figure 3. Architecture of the autoencoder of type Joint AE for categorical features.

Figure 4

Figure 4. Cosine similarity measures for autoencoders for categorical data.

Figure 5

Figure 5. Architecture of type A1 with separate entity embeddings.

Figure 6

Figure 6. Architecture of type A2 with joint embedding.

Figure 7

Figure 7. Distributions of the Poisson loss on the test set (for each network the dotted line represents the average loss in 100 calibrations).

Figure 8

Table 2. Distributions of the poisson loss on the test set, their quantiles (q), average values (avg) and standard deviations (SD).

Supplementary material: PDF

Delong and Kozak supplementary material

Delong and Kozak supplementary material 1

Download Delong and Kozak supplementary material(PDF)
PDF 229.9 KB