Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-06T12:41:19.390Z Has data issue: false hasContentIssue false

Vine variety identification through leaf image classification: a large-scale study on the robustness of five deep learning models

Published online by Cambridge University Press:  12 February 2024

D. De Nart
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Engineering and Agro-Food Processing, via Giacomo Venezian, 26, Milano, Italy
M. Gardiman
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
V. Alba
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
L. Tarricone
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
P. Storchi
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
S. Roccotelli
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
M. Ammoniaci
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
V. Tosi
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
R. Perria
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
R. Carraro*
Affiliation:
Council for Agricultural Research and Economics, Research Centre for Viticulture and Enology, via XXVIII Aprile, 26, Conegliano, Italy
*
Corresponding author: R. Carraro; Email: roberto.carraro@crea.gov.it
Rights & Permissions [Opens in a new window]

Abstract

Varietal identification plays a pivotal role in viticulture for several purposes. Nowadays, such identification is accomplished using ampelography and molecular markers, techniques requiring specific expertise and equipment. Deep learning, on the other hand, appears to be a viable and cost-effective alternative, as several recent studies claim that computer vision models can identify different vine varieties with high accuracy. Such works, however, limit their scope to a handful of selected varieties and do not provide accurate figures for external data validation. In the current study, five well-known computer vision models were applied to leaf images to verify whether the results presented in the literature can be replicated over a larger data set consisting of 27 varieties with 26 382 images. It was built over 2 years of dedicated field sampling at three geographically distinct sites, and a validation data set was collected from the Internet. Cross-validation results on the purpose-built data set confirm literature results. However, the same models, when validated against the independent data set, appear unable to generalize over the training data and retain the performances measured during cross validation. These results indicate that further enhancement have been done in filling such a gap and developing a more reliable model to discriminate among grape varieties, underlining that, to achieve this purpose, the image resolution appears to be a crucial factor in the development of such models.

Information

Type
Crops and Soils Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Class cardinality proportions within the data set.

Figure 1

Figure 2. Inception Net V3 confusion matrix on the cross-validated data.

Figure 2

Figure 3. EfficientNet B5 confusion matrix on the cross-validated data.

Figure 3

Table 1. Mean and standard deviation of the accuracy, the number of true positives over the total number of considered predictions, for the cross validation

Figure 4

Figure 4. Positioning of classes with respect to measured Precision and Recall.

Figure 5

Table 2. Measure of the overall prediction quality (F1 scores) of considered varieties achieved by the models

Figure 6

Figure 5. Vector space learned by the Inception Net V3 model.

Figure 7

Table 3. Model Accuracy scores including the external Kaggle data set

Figure 8

Figure 6. Two samples of Trebbiano toscano (a), Pinot noir (b), and Sangiovese with remarkably different visual features.