Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-06T08:10:14.533Z Has data issue: false hasContentIssue false

A comprehensive survey on advertising click-through rate prediction algorithm

Published online by Cambridge University Press:  21 May 2025

Jing Bai
Affiliation:
Southwest Petroleum University, Chengdu, China
Xinyu Geng*
Affiliation:
Southwest Petroleum University, Chengdu, China
Jiaqi Deng
Affiliation:
Southwest Institute of Electronic Technology, Chengdu, China
Zhen Xia
Affiliation:
Southwest Petroleum University, Chengdu, China
Hongxia Jiang
Affiliation:
Southwest Petroleum University, Chengdu, China
Guoqiang Yan
Affiliation:
Southwest Petroleum University, Chengdu, China
Jing Liang
Affiliation:
Southwest Petroleum University, Chengdu, China
*
Corresponding author: Xinyu Geng; Email: gengxy123@126.com
Rights & Permissions [Opens in a new window]

Abstract

Advertising click-through rate (CTR) prediction is a fundamental task in recommender systems, aimed at estimating the likelihood of users interacting with advertisements based on their historical behavior. This prediction process has evolved through two main stages: from traditional shallow interaction models to more advanced deep learning approaches. Shallow models typically operate at the level of individual features, failing to fully leverage the rich, multilevel information available across different feature sets, leading to less accurate predictions. In contrast, deep learning models exhibit superior feature representation and learning capabilities, enabling a more realistic simulation of user interactions and improving the accuracy of CTR prediction. This paper provides a comprehensive overview of CTR prediction algorithms in the context of recommender systems. The algorithms are categorized into two groups: shallow interactive models and deep learning-based prediction models, including deep neural networks, convolutional neural networks, recurrent neural networks, and graph neural networks. Additionally, this paper also discusses the advantages and disadvantages of the aforementioned algorithms, as well as the benchmark datasets and model evaluation methods used for CTR prediction. Finally, it identifies potential future research directions in this rapidly advancing field.

Information

Type
Review
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Overall framework of convolutional neural network

Figure 1

Figure 2. Recurrent neural network

Figure 2

Figure 3. Instance of graph

Figure 3

Figure 4. A general framework for graph embedding

Figure 4

Table 1. Classification and representative literature of click-through rate prediction models

Figure 5

Table 2. Commonly used notations

Figure 6

Figure 5. Structure diagram of logistic regression model

Figure 7

Figure 6. Structure diagram of LS-PLM

Figure 8

Figure 7. Structure diagram of product layer

Figure 9

Figure 8. Structure diagram of Wide & Deep

Figure 10

Figure 9. Visualization of cross layer

Figure 11

Figure 10. Structure of activation unit

Figure 12

Table 3. Summary of the representative DNN based ad click-through rate prediction model. Specifically, $\boldsymbol{X}$, $\boldsymbol{X}_u$, $\boldsymbol{X}_b$, $\boldsymbol{X}_{cont}$, $\boldsymbol{X}_t$ and $\boldsymbol{X}_{n}$ represent the input feature vector containing multiple fields, the user, the user behavior, the context, the target ad and the negative ad respectively. ‘+’ in the Model Framework indicates that the two models are combined in parallel, and ‘$\rightarrow$’ indicates transmission. Missing values in the table are represented by ‘-’

Figure 13

Figure 11. Basic architecture of applying CNN to CTR prediction

Figure 14

Table 4. Summary of the representative CNN-based ad click-through rate prediction model. Specifically, $\boldsymbol{X}$, $\boldsymbol{X}_A$, $\boldsymbol{X}_Q$, $\boldsymbol{X}_u$, and $\boldsymbol{X}_{cont}$ represent the input feature vector containing multiple fields, the ad, the query, the user, and the context, respectively. In the Pooling column, p-max, MOR, and max & avg represent flexible p-max pooling, mean-overtime region pooling, and max and average pooling, respectively. Missing values in the table are represented by ‘-’

Figure 15

Figure 12. General framework of FGCNN model

Figure 16

Figure 13. RNN training process with BPTT algorithm

Figure 17

Table 5. Summary of the representative RNN based ad click-through rate prediction model. Specifically, $\boldsymbol{X}$, $\boldsymbol{X}_u$, $\boldsymbol{X}_b$, $\boldsymbol{X}_t$, and $\boldsymbol{X}_{cont}$ represent the input feature vector containing multiple fields, the user features, the user behavior features, the target ad features, and the context features, respectively. Att, Multi-Att, and M-Self-Att in the Attention indicate that attention, multi-head attention, and multi-head self-attention, respectively. Missing values in the table are represented by ‘-’

Figure 18

Figure 14. Structure of co-occurrence commodity graph

Figure 19

Table 6. Summary of the representative GNN based ad click-through rate prediction model. Specifically, $\boldsymbol{X}$, $\boldsymbol{X}_u$, $\boldsymbol{X}_b$, $\boldsymbol{X}_t$ and $\boldsymbol{X}_{cont}$ represent the input feature vector containing multiple fields, the user features, the user behavior features, the target ad features and the context features, respectively. Att and M-Self-Att in the Attention Mechanism indicates that attention and multihead self-attention, respectively. Missing values in the table are represented by ‘-’

Figure 20

Table 7. Advantages and disadvantages of CTR prediction algorithms based on shallow interactive model, DNN, CNN, RNN, and GNN

Figure 21

Table 8. The summary of datasets for advertising click-through rate prediction model

Figure 22

Table 9. Evaluation metrics for CTR prediction model