Hostname: page-component-77c78cf97d-sp94z Total loading time: 0 Render date: 2026-04-24T16:33:15.880Z Has data issue: false hasContentIssue false

Exploration-exploitation-based trajectory tracking of mobile robots using Gaussian processes and model predictive control

Published online by Cambridge University Press:  30 June 2023

Hannes Eschmann
Affiliation:
Institute of Engineering and Computational Mechanics, University of Stuttgart, Stuttgart, Germany
Henrik Ebel
Affiliation:
Institute of Engineering and Computational Mechanics, University of Stuttgart, Stuttgart, Germany
Peter Eberhard*
Affiliation:
Institute of Engineering and Computational Mechanics, University of Stuttgart, Stuttgart, Germany
*
Corresponding author: Peter Eberhard; Email: peter.eberhard@itm.uni-stuttgart.de
Rights & Permissions [Opens in a new window]

Abstract

Mobile robots are a key component for the automation of many tasks that either require high precision or are deemed too hazardous for human personnel. One of the typical duties for mobile robots in the industrial sector is to perform trajectory tracking, which involves pursuing a specific path through both space and time. In this paper, an iterative learning-based procedure for highly accurate tracking is proposed. This contribution shows how data-based techniques, namely Gaussian process regression, can be used to tailor a motion model to a specific reoccurring reference. The procedure is capable of explorative behavior meaning that the robot automatically explores states around the prescribed trajectory, enriching the data set for learning and increasing the robustness and practical training accuracy. The trade-off between highly accurate tracking and exploration is done automatically by an optimization-based reference generator using a suitable cost function minimizing the posterior variance of the underlying Gaussian process model. While this study focuses on omnidirectional mobile robots, the scheme can be applied to a wide range of mobile robots. The effectiveness of this approach is validated in meaningful real-world experiments on a custom-built omnidirectional mobile robot where it is shown that explorative behavior can outperform purely exploitative approaches.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. The custom-built omnidirectional mobile robot platform (left) and its special Mecanum wheels (right).

Figure 1

Figure 2. The set $\mathcal{A}$ for the input $\boldsymbol{{u}}_t$ (its first three components) with the samples used for the training procedure in red. The set for the corresponding input $\boldsymbol{{u}}_{t-1}$ and the input rate $\boldsymbol{{u}}_t-\boldsymbol{{u}}_{t-1}$, while conceptually similar, are not depicted.

Figure 2

Figure 3. Selection of some of the training trajectories used for the data generation. The orientation of the robot is not depicted.

Figure 3

Figure 4. Visualization of the proposed iterative procedure.

Figure 4

Table I. Different modifications to the optimization problem (23) to obtain a periodic state and input reference.

Figure 5

Table II. The kernel hyperparameters for the three different GPs approximating the function $\boldsymbol{{h}}$.

Figure 6

Figure 5. For the hardware experiments, the robot drives five consecutive laps for each model on the $\infty$-shaped trajectory. The errors to the desired trajectory $\boldsymbol{{x}}^{\text{des}}_t$ (dashed line) are shown using a magnification of 10 for better visibility. The orientation of the robot is not depicted.

Figure 7

Figure 6. The RMSE for the position of the robot as well as the term $L_\Sigma$ for the $\infty$-shaped trajectory. The initial ($0{\text{th}}$) is the nominal solution, and the first iteration is the solution corresponding to no additional trajectory-specific information, that is, ${\boldsymbol{{f}}_\boldsymbol{{g}}}$.

Figure 8

Figure 7. Error for the deviations from the desired $\infty$-shaped trajectory $\boldsymbol{{x}}^{\textrm{des}}_t-\boldsymbol{{x}}_t$ on position level (left) and orientation level (right) using the three different robot models.

Figure 9

Figure 8. The RMSE for the position of the robot as well as the term $L_\Sigma$ with $\gamma =0$ and $\gamma =0.01$ for the rectangular trajectory. The initial ($0{\text{th}}$) is the nominal solution, and the first iteration is the solution corresponding to no additional trajectory-specific information, that is, ${\boldsymbol{{f}}_\boldsymbol{{g}}}$. The seventh and final iteration for the exploration-based approach $\gamma =0$ (pure exploitation) was used.

Figure 10

Figure 9. Final tracking results for the rectangle trajectory using different robot models. The errors to the desired trajectory $\boldsymbol{{x}}^{\text{des}}_t$ (dashed line) are shown using a magnification of 25 for better visibility. For ${\boldsymbol{{f}}_\boldsymbol{{gh}}}$, only the iterations with the lowest position error are depicted.

Figure 11

Figure 10. Average computational time for solving (23) for each iteration using different values of $\gamma$. The times were recorded for the rectangular trajectory ($N=197$). The first iteration is the solution corresponding to no additional trajectory-specific information, that is, ${\boldsymbol{{f}}_\boldsymbol{{g}}}$.