Hostname: page-component-89b8bd64d-46n74 Total loading time: 0 Render date: 2026-05-06T12:35:20.288Z Has data issue: false hasContentIssue false

Managing streamed sensor data for mobile equipment prognostics

Published online by Cambridge University Press:  06 April 2022

Toby Griffiths
Affiliation:
System Health Lab, Department of Mechanical Engineering, University of Western Australia, Crawley, Western Australia 6009, Australia
Débora Corrêa*
Affiliation:
Department of Computer Science and Software Engineering, University of Western Australia, Crawley, Western Australia 6009, Australia ARC Industrial Transformation Training Centre (Transforming Maintenance through Data Science), University of Western Australia, Crawley, Western Australia 6009, Australia
Melinda Hodkiewicz
Affiliation:
System Health Lab, Department of Mechanical Engineering, University of Western Australia, Crawley, Western Australia 6009, Australia ARC Industrial Transformation Training Centre (Transforming Maintenance through Data Science), University of Western Australia, Crawley, Western Australia 6009, Australia
Adriano Polpo
Affiliation:
System Health Lab, Department of Mechanical Engineering, University of Western Australia, Crawley, Western Australia 6009, Australia Department of Computer Science and Software Engineering, University of Western Australia, Crawley, Western Australia 6009, Australia ARC Industrial Transformation Training Centre (Transforming Maintenance through Data Science), University of Western Australia, Crawley, Western Australia 6009, Australia
*
*Corresponding author. E-mail: debora.correa@uwa.edu.au

Abstract

The ability to wirelessly stream data from sensors on heavy mobile equipment provides opportunities to proactively assess asset condition. However, data analysis methods are challenging to apply due to the size and structure of the data, which contain inconsistent and asynchronous entries, and large periods of missing data. Current methods usually require expertise from site engineers to inform variable selection. In this work, we develop a data preparation method to clean and arrange this streaming data for analysis, including a data-driven variable selection. Data are drawn from a mining industry case study, with sensor data from a primary production excavator over a period of 9 months. Variables include 58 numerical sensors and 40 binary indicators captured in 45-million rows of data describing the conditions and status of different subsystems of the machine. A total of 57% of time stamps contain missing values for at least one sensor. The response variable is drawn from fault codes selected by the operator and stored in the fleet management system. Application to the hydraulic system, for 21 failure events identified by the operator, shows that the data-driven selection contains variables consistent with subject matter expert expectations, as well as some sensors on other systems on the excavator that are less easy to explain from an engineering perspective. Our contribution is to demonstrate a compressed data representation using open-high-low-close and variable selection to visualize data and support identification of potential indicators of failure events from multivariate streamed data.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Table 1. An extract from the raw sensor data showing the structure and asynchronous sensor recording.

Figure 1

Figure 1. An extract of the raw data, with a region of missing data highlighted by the vertical red line.

Figure 2

Table 2. The proposed baseline data frame. After removing rows containing Not a Number (NAN) values, the final dimension of the data is 3,591 × 35. The total number of events is 31 for the response variable given by the fleet management data.

Figure 3

Table 3. The sensor data extract presented in Table 1 after pivoting, showing several unnecessary missing values.

Figure 4

Table 4. The sensor data extract after preprocessing steps and aligned with event data.

Figure 5

Figure 2. The open-high-low-close data (top) retain the important features of the raw data (bottom).

Figure 6

Figure 3. A representation of a lag data model for one sensor, using a window of 5 hr. The red line indicates the time of failure. The last 5 hr of each sensor before the failure are used as variables in the lagged model. As we do not use overlap between the windows, this process results in a data frame with dimension 1,093 × 107, which is further reduced to 581 × 107 after removing missing rows.

Figure 7

Table 5. $ \beta $s, odds ratio, and 95% confidence interval for the odds ratio from LASSO logistic regression variable selection with the fleet management response variable. $ \mathrm{Loss}\left(\boldsymbol{\theta} |\mathcal{D}\right)=211.85 $, $ \lambda =10 $.

Figure 8

Figure 4. Functional blocks in the hydraulic system.

Figure 9

Figure 5. Physical layout of the hydraulic pumps, pump transmission system, and engines.

Figure 10

Figure 6. A comparison over time of hydraulic faults recorded in the fleet management system with corrective maintenance work orders. The green lines indicate dates for which the events match in both systems. The blue lines indicate event dates for the individual systems. The orange lines indicate when an event in the fleet management system had a work order in the next 7 days. The data correspond to hourly open-high-low-close.

Submit a response

Comments

No Comments have been published for this article.