Hostname: page-component-5db58dd55d-qmkzp Total loading time: 0 Render date: 2026-05-31T01:48:41.180Z Has data issue: false hasContentIssue false

A robust generalized deep monotonic feature extraction model for label-free prediction of degenerative phenomena

Published online by Cambridge University Press:  04 February 2026

Panagiotis Komninos
Affiliation:
Aerospace Structures and Materials, Faculty of Aerospace Engineering, Delft University of Technology , Delft, The Netherlands
Thanos Kontogiannis
Affiliation:
Aerospace Structures and Materials, Faculty of Aerospace Engineering, Delft University of Technology , Delft, The Netherlands
Nick Eleftheroglou
Affiliation:
Aerospace Structures and Materials, Faculty of Aerospace Engineering, Delft University of Technology , Delft, The Netherlands
Dimitrios Zarouchas*
Affiliation:
Aerospace Structures and Materials, Faculty of Aerospace Engineering, Delft University of Technology , Delft, The Netherlands
*
Corresponding author: Dimitrios Zarouchas; Email: d.zarouchas@tudelft.nl

Abstract

Addressing and predicting degenerative phenomena in domains such as health care and engineering, two fundamental fields of vital importance for society, offers valuable insights into early warning steps and critical event forecasting, leading to far-reaching implications for safety and resource allocation. By harnessing the power of data-driven insights, prognostics becomes the principal component of predicting such phenomena. Developing clustering techniques as feature extractors acts as an intermediate step between the raw incoming data and prognostics and provides the opportunity to unveil hidden relationships within complex datasets. However, when limited, noisy, and multimodal data are available in a label-free format, extensive preprocessing, and unreliable, complicated models are required for extracting meaningful features. This prohibits the development of adaptable methods in diverse domains that are in favor of robustness and interpretability. In this regard, this study introduces a novel unsupervised deep clustering model for feature extraction in degenerative phenomena. The model innovatively extracts prognostic-related features from raw data via clustering analysis, characterized by an increasing monotonic behavior representing system deterioration. This monotonicity is partial rather than complete, to incorporate the potential occurrence of oscillations in the degradation trajectory of the system or noise-related data, reflecting real-world scenarios. Its performance, robustness, generalizability, and interpretability are evaluated across diverse domains utilizing three datasets from health care and engineering featuring limited, noisy, high-dimensional, and multimodal raw signals. Results show that the model extracts meaningful prognostic-related features in both domains and all datasets, without a significant alteration in its architecture and independently of the chosen prognostic algorithm.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Figure 1. The concept of the proposed methodology.

Figure 1

Table 1. List of variables extracted by the MIMIC-III dataset

Figure 2

Table 2. List of variables extracted by the C-MAPSS dataset

Figure 3

Table 3. The low-level features that are considered and extracted by the AMSY-6 Vallen Systeme GmbH

Figure 4

Table 4. General characteristics of the F-MOC dataset

Figure 5

Figure 2. (a) Module-level architecture of the proposed AE model. (b) Detailed architecture of the proposed AE model. (c) Detailed architecture of the DSMC model used for soft monotonic clustering.

Figure 6

Figure 3. Redesigned architecture of the model regarding the F-MOC dataset.

Figure 7

Table 5. Hyperparameters’ values for the LSTM and CNN3D layers

Figure 8

Table 6. Hyperparameter search ranges and final values optimized by the Bayesian optimization algorithm for each dataset

Figure 9

Figure 4. (a) Clustering results for the MIMIC-III dataset. (b) Kaplan–Meier curves for the MIMIC-III dataset (“h” stands for “hours”). (c) Clustering results for the C-MAPSS dataset. (d) Kaplan–Meier curves for the C-MAPSS dataset (“c” stands for “cycles”).

Figure 10

Figure 5. ROC curves after 1, 25, 50, and 80 h after each patient’s entrance to the ICU.

Figure 11

Figure 6. RUL prediction of different prognostic models for one inner and one outlier trajectory. (a) Testing engine 5, whose length is close to average. (b) Testing engine 10, representing the right outlier.

Figure 12

Table 7. Comparison table for the MIMIC-III dataset based on standard metrics between our proposed DSMC model and three widely used in healthcare scoring systems, including SOFA, SAPS III, and APACHE II

Figure 13

Figure 7. (a) Constructed HIs utilizing the LSTMCAE model. (b) Comparison of RUL curves between DSMC and LSTMCAE for test engine 5. (c) Comparison of RUL curves between DSMC and LSTMCAE for test engine 5.

Figure 14

Table 8. RMSE for each test engine calculated by the HSMM model (ours) and the LSTMCAE model

Figure 15

Figure 8. Time gradients (gradients of each encoder’s output with respect to the time feature). (a) Time gradients for the MIMIC-III dataset. The time gradients for $ {z}_1 $ are zoomed in for clarity. (b) Time gradients for the C-MAPSS dataset. The time gradients for $ {z}_2 $ are zoomed in for clarity.

Figure 16

Figure 9. (a) Z-space visualization for the C-MAPSS dataset before training the DSMC model. (b) Z-space visualization for the C-MAPSS dataset after training the DSMC model. (c) t-SNE graph with two principal components for the C-MAPSS dataset before training the DSMC model. (d) t-SNE graph with two principal components for the C-MAPSS dataset after training the DSMC model.

Figure 17

Figure 10. (a) Z-space visualization for the MIMIC-III dataset before training the DSMC model. (b) Z-space visualization for the MIMIC-III dataset after training the DSMC model. (c) t-SNE graph for the MIMIC-III dataset before training the DSMC model. (d) t-SNE graph for the MIMIC-III dataset after training the DSMC model.

Figure 18

Figure 11. (a) Clustering results for the F-MOC dataset. (b) Z-space visualization for the F-MOC dataset after training the DSMC model.

Figure 19

Figure 12. (a) Stochastic RUL predictions of the first test specimen. (b) Stochastic RUL predictions of the second test specimen.

Figure 20

Figure 13. Qualitative analysis of the number of clusters hyperparameter for (a) MIMIC-III dataset, (b) C-MAPSS dataset, and (c) F-MOC dataset.

Figure 21

Figure A1. Survivability rates of a test subset of the patients at different time steps. The time steps are chosen at 0%, 25%, 50%, and 75% of the corresponding patient’s true time of stay in the ICU until mortality.

Figure 22

Table A1. Reproducibility of the training process

Figure 23

Figure A2. Reliability curves of a test subset of the engines at different time steps. The time steps are chosen at 0%, 25%, 50%, and 75% of the corresponding engine’s true lifespan.

Figure 24

Figure A3. The convergence of train and validation losses, including the reconstruction loss of the input and reconstruction loss of the time feature, for the three datasets after the first stage of training of the DSMC model (AE training). (a, b) MIMIC-III dataset. (c, d) C-MAPSS dataset. (e, f) F-MOC dataset.

Figure 25

Figure B1. Two examples of image snapshots taken from a camera representing one specimen subject to fatigue loads. (a) Healthy. (b) Severely damaged.

Figure 26

Figure B2. An example of a specimen’s low-level features representing amplitude, duration, and energy, as extracted from the AMSY-6 Vallen Systeme GmbH across its lifetime. These features are displayed along the $ y $-axis, while the hit time feature is depicted along the $ x $-axis. Details of each low-level feature can be found in Table 3.

Submit a response

Comments

No Comments have been published for this article.