Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-06T12:00:43.995Z Has data issue: false hasContentIssue false

Logistic Multidimensional Data Analysis for Ordinal Response Variables Using a Cumulative Link Function

Published online by Cambridge University Press:  27 March 2025

Mark de Rooij*
Affiliation:
Methodology and Statistics Department, Leiden University, Leiden, The Netherlands
Ligaya Breemer
Affiliation:
Methodology and Statistics Department, Leiden University, Leiden, The Netherlands
Dion Woestenburg
Affiliation:
Methodology and Statistics Department, Leiden University, Leiden, The Netherlands
Frank Busing
Affiliation:
Methodology and Statistics Department, Leiden University, Leiden, The Netherlands
*
Corresponding author: Mark de Rooij; Email: rooijm@fsw.leidenuniv.nl
Rights & Permissions [Opens in a new window]

Abstract

We present a multidimensional data analysis framework for the analysis of ordinal response variables. Underlying the ordinal variables, we assume a continuous latent variable, leading to cumulative logit models. The framework includes unsupervised methods, when no predictor variables are available, and supervised methods, when predictor variables are available. We distinguish between dominance variables and proximity variables, where dominance variables are analyzed using inner product models, whereas the proximity variables are analyzed using distance models. An expectation–majorization–minimization algorithm is derived for estimation of the parameters of the models. We illustrate our methodology with three empirical data sets highlighting the advantages of the proposed framework. A simulation study is conducted to evaluate the performance of the algorithm.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Psychometric Society
Figure 0

Figure 1 Probability density function for a continuous latent variable z with thresholds (indicated by the vertical lines) giving rise to an observed ordered categorical variable with categories, strongly disagree (SD), disagree (D), neutral (N), agree (A), and strongly agree (SA).

Figure 1

Figure 2 Biplot representations for CLPCA (left) and CLMDU (right) for a single response variable.Note: Variable markers for cumulative probabilities are added. Grey points represent observations. On the left, the green solid line represents the variable axis with markers indicating the estimated thresholds. The dotted lines indicate decision regions for the categories of the response variable. On the right, the green point represents the response variable. The circles represent decision boundaries, where outside the circle the first category of the label is preferred and inside the circle the second category of the label.

Figure 2

Figure 3 Estimated configuration for students data.Note: The dark green lines represent the response variables, the blue lines represent the predictor variables. Variable labels are placed on the positive side of the variables, that are the sides with the largest values. In the upper right corner the labels of ENGa and A1d overlap.

Figure 3

Table 1 Fit statistics for the dominance (CLRRR) and proximity (CLRMDU) analysis of pro-environmental behavior in one, two, and three dimensions

Figure 4

Figure 4 Biplot for the cumulative logistic restricted multidimensional unfolding solution relating environmental attitudes with pro-environmental behavior.

Figure 5

Figure 5 Estimated configuration for environmental efficacy data.

Figure 6

Figure 6 Estimated configuration for environmental efficacy data with decision regions for Item 4 (a) and Item 7 (b).

Figure 7

Figure 7 Simulation results for cumulative logistic reduced rank regression.Note: R denotes the number of response variables, C the number of response categories per response variable. The three columns show different distributions for the predictor variables (normal, uniform, Likert). On the horizontal axis we show the different sample sizes, while on the vertical axes, the value of recovery is found where lower values represent better recovery.

Figure 8

Figure 8 Simulation results for cumulative logistic restricted multidimensional unfolding.Note: R denotes the number of response variables, C the number of response categories per response variable. The three columns show different distributions for the predictor variables (normal, uniform, Likert). On the horizontal axis we show the different sample sizes, while on the vertical axes, the value of recovery is found where lower values represent better recovery.

Figure 9

Table 2 Results of simulation studies for dimension selection

Figure 10

Table A1 Fit statistics for the student data

Figure 11

Table A2 Implied coefficient from the CLRRR analysis for the student data

Figure 12

Table A3 Estimated coefficient from proportional odds models for the student data

Figure 13

Figure B1 Biplot for the cumulative logistic restricted multidimensinal unfolding solution relating environmental attitudes with pro-environmental behavior.

Figure 14

Figure B2 Biplot for the cumulative logistic reduced rank model relating environmental attitudes with pro-environmental behavior.