Hostname: page-component-6766d58669-nqrmd Total loading time: 0 Render date: 2026-05-20T06:58:47.057Z Has data issue: false hasContentIssue false

Robust pollination for tomato farming using deep learning and visual servoing

Published online by Cambridge University Press:  13 November 2024

Rajmeet Singh
Affiliation:
Khalifa University Center for Autonomous Robotic Systems (KUCARS), Khalifa University, Abu Dhabi, UAE
Lakmal Seneviratne
Affiliation:
Khalifa University Center for Autonomous Robotic Systems (KUCARS), Khalifa University, Abu Dhabi, UAE
Irfan Hussain*
Affiliation:
Khalifa University Center for Autonomous Robotic Systems (KUCARS), Khalifa University, Abu Dhabi, UAE
*
Corresponding author: Irfan Hussain; Email: irfan.hussain@ku.ac.ae
Rights & Permissions [Opens in a new window]

Abstract

In this work, we propose a novel approach for tomato pollination that utilizes visual servo control. The objective is to meet the growing demand for automated robotic pollinators to overcome the decline in bee populations. Our approach focuses on addressing this challenge by leveraging visual servo control to guide the pollination process. The proposed method leverages deep learning to estimate the orientations and depth of detected flower, incorporating CAD-based synthetic images to ensure dataset diversity. By utilizing a 3D camera, the system accurately estimates flower depth information for visual servoing. The robustness of the approach is validated through experiments conducted in a laboratory environment with a 3D printed tomato flower plant. The results demonstrate a high detection rate, with a mean average precision of 91.2 %. Furthermore, the average depth error for accurately localizing the pollination target is impressively minimal, measuring only 1.1 cm. This research presents a promising solution for tomato pollination, showcasing the effectiveness of visual-guided servo control and its potential to address the challenges posed by diminishing bee populations in greenhouses.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. (a) Tomato flower structure, (b) bee pollination, (c) hand brush pollination, and (d) vibrator toothbrush pollination.

Figure 1

Figure 2. (a) CAD model of pollinator and (b) prototype model of pollinator mounted on UR5 robot.

Figure 2

Figure 3. Universal robot (UR5) joint configuration and Denavit–Hartenberg (DH) parameters.

Figure 3

Figure 4. Control visual servoing loops.

Figure 4

Figure 5. Two PID controllers scheme to effectively coordinate the robot’s approach, alignment, and orientation for successful pollination.

Figure 5

Figure 6. Components of the image feedback visual-servoing system.

Figure 6

Figure 7. Flower orientation labelling based on ITFP annotation wheel system.

Figure 7

Figure 8. (a) Robot wrist (position and orientation) for pollination control, (b) Class (0,255), (c) Class (0,270), and (d) Class (0,315).

Figure 8

Table I. Flower orientation classes with desired robot wrist angle values.

Figure 9

Figure 9. Camera model for flower depth calculation.

Figure 10

Figure 10. Camera-robot calibration (a) Aruco marker, (b) hand-to-eye camera robot calibration model, (c) Aruco marker visualization in RVIZ, and (d) Camera-aruco marker hand to eye calibration.

Figure 11

Figure 11. Flow diagram of ITFP workflow system.

Figure 12

Figure 12. Distribution graph for all the classes in the flower dataset (roboflow and synthetic data).

Figure 13

Figure 13. Example of labeling the dataset (a) roboflow dataset and (b) CAD synthetic dataset).

Figure 14

Table II. Amount of data for respective classes values.

Figure 15

Figure 14. YOLOv8 diagram of model architecture.

Figure 16

Table III. Performance evaluation comparison with different backbone architectures.

Figure 17

Table IV. Mean average precision (mAP) values.

Figure 18

Figure 15. Some examples of correctly classified results (a) real flower images dataset, and (b) CAD generated synthetic dataset.

Figure 19

Figure 16. Inference time vs batch size.

Figure 20

Table V. Comparison of YOLOv8n model weight size with different backbone architectures models.

Figure 21

Table VI. Comparison of orientation detection accuracies and recall: Our approach versus Strader et. al (2019) [8].

Figure 22

Figure 17. Average performance parameters of proposed model (a) precision, (b) recall, (c) mAP 0.5, and (d) loss.

Figure 23

Figure 18. Flower orientation with depth estimation.

Figure 24

Figure 19. 3D trajectory and translational velocity: starting and target orientations for the pollination process.

Figure 25

Figure 20. 3D trajectory and translational velocity: starting and target orientations for the pollination process.

Figure 26

Figure 21. (a) Sequential camera view images with corresponding depth information, and (b) Pollination done on 3D printed tomato flower plant.