Hostname: page-component-77f85d65b8-6bnxx Total loading time: 0 Render date: 2026-03-28T17:03:45.649Z Has data issue: false hasContentIssue false

Hierarchical Gaussian processes for characterizing gait variability in multiple sclerosis

Published online by Cambridge University Press:  07 August 2025

Alexandru Stihi
Affiliation:
Department of Mechanical Engineering, Insigneo Institute for In Silico Medicine, University of Sheffield, Sheffield, UK Dynamics Research Group, Department of Mechanical Engineering, University of Sheffield, Sheffield, UK
Claudia Mazzà
Affiliation:
Department of Mechanical Engineering, Insigneo Institute for In Silico Medicine, University of Sheffield, Sheffield, UK
Elizabeth Cross
Affiliation:
Dynamics Research Group, Department of Mechanical Engineering, University of Sheffield, Sheffield, UK
Timothy James Rogers*
Affiliation:
Dynamics Research Group, Department of Mechanical Engineering, University of Sheffield, Sheffield, UK
*
Corresponding author: Timothy James Rogers; Email: tim.rogers@sheffield.ac.uk

Abstract

Reduction in mobility due to gait impairment is a critical consequence of diseases affecting the neuromusculoskeletal system, making detecting anomalies in a person’s gait a key area of interest. This challenge is compounded by within-subject and between-subject variability, further emphasized in individuals with multiple sclerosis (MS), where gait patterns exhibit significant heterogeneity. This study introduces a novel perspective on modeling kinematic gait patterns, recognizing the inherent hierarchical structure of the data, which is gathered from contralateral limbs, individuals, and groups of individuals comprising a population, using wearable sensors. Rather than summarizing features, this approach models the entire gait cycle functionally, including its variation. A Hierarchical Variational Sparse Heteroscedastic Gaussian Process was used to model the shank angular velocity across 28 MS and 28 healthy individuals. The utility of this methodology was underscored by its granular analysis capabilities. This facilitated a range of quantifiable comparisons, spanning from group-level assessments to patient-specific analyses, addressing the complexity of pathological gait patterns and offering a robust methodology for kinematic pattern characterization for large datasets. The group-level analysis highlighted notable differences during the swing phase and towards the end of the stance phase, aligning with previously established literature findings. Moreover, the study identified the heteroscedastic gait pattern variability as a distinguishing feature of MS gait. Additionally, a novel approach for lower limb gait asymmetry quantification has been proposed. The use of probabilistic hierarchical modeling facilitated a better understanding of the impaired gait pattern, while also expressing potential for extrapolation to other pathological conditions affecting gait.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Comparison of shank angular velocity data between healthy controls (HC) and individuals with multiple sclerosis (MS). The figure presents an aggregate of data points collected using inertial measurement units from both left and right limbs, encompassing 7899 gait cycles from 28 healthy controls and 7105 gait cycles from 28 individuals affected by multiple sclerosis.

Figure 1

Figure 2. An illustration of a simple hierarchical GP. Top: solid line—a single sample from the prior over the underlying group function $ {f}_g(x) $. Dotted line—zero-mean function. Shaded area: $ \pm 1 $ standard deviation of functions, $ {\sigma}_g^2 $. Middle: three samples conditioned on $ {f}_g(x) $ and corresponding to three distinct individuals. The individual samples follow the trend of $ {f}_g(x) $, but vary by a small amount, $ {\sigma}_i^2 $. The length-scale of the group and individual functions are denoted by $ {l}_g $ and $ {l}_i $, respectively. Bottom: block-wise covariance matrix used to generate samples.

Figure 2

Table 1. Demographics table

Figure 3

Figure 3. Hierarchical modeling structure. From left to right: population layer, group layer (HC/MS), individual layer (combining the individual left and right limbs), individual limb layer (modeling the left and right limbs separately).

Figure 4

Figure 4. Group-level GP predictions against test data for homoscedastic models (first row) and heteroscedastic models (second row). (a) HC Homoscedastic model, (b) HC Heteroscedastic model, (c) Homoscedastic MS model, (d) Heteroscedastic MS Model.

Figure 5

Figure 5. Mean and variance differences between the homoscedastic and heteroscedastic models.

Figure 6

Table 2. Comparison of the performance metrics for the homoscedastic and heteroscedastic group models

Figure 7

Figure 6. Left: GP group predictions, Right: Samples drawn from the GPs. The vertical dotted lines, from left to right, correspond to the group-averaged mid-stance, toe-off and mid-swing events. Toe-off events were defined as the time point where the minimum negative peak occurs immediately before the maximum positive peak during each stride. The mid-stance point corresponds to the halfway point between the start of the gait cycle and the toe-off event, whereas the mid-swing corresponds to the maximum amplitude point in the signal.

Figure 8

Figure 7. Visualization of the group differences. The solid white line shows the difference between the means of the two GPs, whereas the curved dashed lines showcase two standard deviations from the mean. The horizontal dashed line has been added here only for highlighting zero-crossing points. Notably, although the $ \pm 2\sigma $ interval may not seem symmetric above and below the mean upon initial observation, it is, in fact, symmetric. Disparities between the HC and MS models are highlighted across the input space using the MMD.

Figure 9

Figure 8. Individual predictions versus held-out test data. The first four rows correspond to HC individuals, while the last four rows correspond to PwMS.

Figure 10

Figure 9. Individual differences, MMD highlighted. The first four rows correspond to HC individuals, while the last four rows correspond to PwMS.

Figure 11

Figure 10. Heteroscedastic variance differences at the individual layer in the hierarchy, where data from the left and right limbs are combined for each individual. Each line corresponds to a single subject.

Figure 12

Figure 11. Individual limb GP predictions. The first four rows correspond to HCs, while the last four rows correspond to PwMS. The solid lines represent the mean GP predictions, while the dotted lines correspond to the $ \pm 2\sigma $ interval. The dots represent the held-out test data.

Figure 13

Figure 12. (a) HC, (b) MS—Unified Wasserstein distance computed between samples drawn from the left GP and the right GP for each of the individuals in both HC and MS groups. (c) Statistical comparison between the HC and MS groups.

Figure 14

Figure 13. KL divergence computed between the individual level GP, combining the left and right limbs (third layer), and the individual limbs GP models, treating each limb individually (fourth layer). (a) HC, (b) MS. The lighter shade represents the left shank, while the darker shade represents the right shank. (c) statistical comparison.

Figure 15

Figure E1. Comparison of GP posterior distributions using KL divergence. (Left) KL divergence computed using the healthy control (HC) group GP posterior as the reference distribution. (Right) KL divergence computed using the MS group GP posterior as the reference distribution. The solid white line represents the mean difference between the two GPs, while the dashed lines indicate the $ \pm 2\sigma $ uncertainty bounds of the difference.

Figure 16

Figure E2. Comparison of individual differences using KL divergence. (Top) KL divergence computed using the healthy control (HC) group GP posterior as the reference distribution. (Bottom) KL divergence computed using individual-specific posteriors as reference distributions. The first four rows represent HC individuals, while the last four rows represent people with MS (PwMS). Higher values (yellow) indicate greater divergence between distributions.

Figure 17

Table F1. Performance metrics of individual models aggregating contralateral limb data

Figure 18

Figure G1. Wasserstein distance computed between left and right limb models.

Submit a response

Comments

No Comments have been published for this article.