Multivariate statistical analysis of data from the PERMA+4 questionnaire “Subjective well-being of an organization’s employees”: Application of dimension reduction and clustering methods

26 August 2025, Version 1
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

The development of machine learning methods opens new opportunities for analyzing multidimensional psychological constructs traditionally studied via classical statistical approaches. This study presents a comprehensive multidimensional statistical analysis of data on the employee's subjective well-being obtained via PERMA+4 questionnaire. This study used contemporary methods of dimensionality reduction (PCA, t-SNE, UMAP, Isomap, MDS) and clustering (K-means, DBSCAN, agglomerative clustering) to reveal the latent structure of wellbeing data. The quality of the solutions was assessed via a set of validated metrics: the silhouette score, the Kalinski–Harabasz score, and the Davies–Bouldin score. The sample consisted of 325 respondents. Measurements were taken across nine employee well-being indicators included in the PERMA+4 model. This study revealed the exceptionally high effectiveness of UMAP in combination with K-means clustering (silhouette coefficient = 0.942). A stable 2-cluster data structure was identified, reflecting a qualitative difference between groups of employees with moderate (78%) and high (22%) levels of well-being. All measures used showed statistically significant differences between clusters (p<0.001, effect sizes r=0.405-0.672). Correlation analysis of the UMAP space revealed the dominance of a general wellbeing factor (first axis) with a specific role for Economic security as a partially independent measure (second axis). The results obtained not only make a significant contribution to understanding the interaction of the components of subjective employee well-being, confirm its systemic nature, and provide empirical grounds for developing differentiated strategies to improve well-being but also demonstrate the high applicability of nonlinear dimension reduction methods for analyzing the structure of psychometric data.

Keywords

subjective well-being
PERMA+4
dimension reduction
clustering
UMAP
multivariate statistical analysis
clustering quality metrics

Supplementary materials

Title
Description
Actions
Title
PERMA+4 Data
Description
Data collected via PERMA+4 questionnaire
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.