Hostname: page-component-89b8bd64d-x2lbr Total loading time: 0 Render date: 2026-05-08T17:06:22.859Z Has data issue: false hasContentIssue false

Using Neural Network Models for Wine Review Classification

Published online by Cambridge University Press:  07 April 2022

Duwani Katumullage
Affiliation:
Department of Statistical Science, Southern Methodist University, Dallas, Texas, 75275; e-mail: dkatumullage@smu.edu.
Chenyu Yang
Affiliation:
Department of Statistical Science, Southern Methodist University, Dallas, Texas, 75275; e-mail: chenyuy@smu.edu.
Jackson Barth
Affiliation:
Department of Statistical Science, Southern Methodist University, Dallas, Texas, 75275; e-mail: jbarth@smu.edu.
Jing Cao*
Affiliation:
Department of Statistical Science, Southern Methodist University, Dallas, Texas, 75275
*
e-mail: jcao@smu.edu (corresponding author).
Rights & Permissions [Opens in a new window]

Abstract

Wines are usually evaluated by wine experts and enthusiasts who give numeric ratings as well as text reviews. While most wine classification studies have been based on conventional statistical models using numeric variables, there has been very limited work on implementing neural network models using wine reviews. In this paper, we apply neural network models (CNN, BiLSTM, and BERT) to extract useful information from wine reviews and classify wines according to different rating classes. Using a large collection of wine reviews from Wine Spectator, the study shows that BERT, a neural network framework recently developed by Google, has the best performance. In the two-class classification (90–100 and 80–89), BERT achieves an accuracy of 89.12%, followed by BiLSTM (88.69%) and CNN (88.02%). In the four-class classification (95–100, 90–94, 85–89, and 80–84), BERT yields an 81.57% accuracy, while the other two produce an 80% accuracy. The neural network models in the paper are independent of domain knowledge and thus can be easily extended to other kinds of text analysis. Expanding the limited work on wine text review classification studies, these models are up-to-date and provide valuable additions to wine data analysis. (JEL Classifications: C45, C88, D83)

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2022. Published by Cambridge University Press on behalf of American Association of Wine Economists
Figure 0

Figure 1 Word Cloud of the Review Corpus

Figure 1

Table 1 Proportion of Reviews in Each Class for the Two-Class and Four-Class Classifications

Figure 2

Figure 2 CNN/BiLSTM PipelineSource: Kowsari et al. (2019).

Figure 3

Figure 3 BERT PipelineSource: Haren (2019).

Figure 4

Table 2 Classification Contingency Table

Figure 5

Table 3 Accuracy, Precision, Recall, and F-1 Score of the Models