Hostname: page-component-77f85d65b8-zzw9c Total loading time: 0 Render date: 2026-03-30T10:02:19.060Z Has data issue: false hasContentIssue false

Biclustering Models for Two-Mode Ordinal Data

Published online by Cambridge University Press:  01 January 2025

Eleni Matechou*
Affiliation:
University of Kent
Ivy Liu
Affiliation:
Victoria University of Wellington
Daniel Fernández
Affiliation:
Victoria University of Wellington
Miguel Farias
Affiliation:
Coventry University
Bergljot Gjelsvik
Affiliation:
University of Oxford University of Oslo
*
Correspondence should be made to Eleni Matechou, School of Mathematics, Statistics and Actuarial Science, University of Kent, Cornwallis Building, Canterbury, CT2 7NF UK. Email: e.matechou@kent.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

The work in this paper introduces finite mixture models that can be used to simultaneously cluster the rows and columns of two-mode ordinal categorical response data, such as those resulting from Likert scale responses. We use the popular proportional odds parameterisation and propose models which provide insights into major patterns in the data. Model-fitting is performed using the EM algorithm, and a fuzzy allocation of rows and columns to corresponding clusters is obtained. The clustering ability of the models is evaluated in a simulation study and demonstrated using two real data sets.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright
Copyright © 2016 The Author(s). This article is published with open access at Springerlink.com
Figure 0

Table 1. Model set with corresponding number of parameters ν.\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\nu .$$\end{document}

Figure 1

Table 2. Information criteria summary table.

Figure 2

Figure 1. Simulation study to assess the performance of model selection criteria in recovering the true number of clusters for our proposed biclustering finite mixture PO (POFM) model. Bars depict the percentage of cases when the true model is correctly identified by each criterion, averaged across the five scenarios.

Figure 3

Table 3. The average estimate obtained for each parameter over 100 simulations.

Figure 4

Table 4. The average Rand index for 100 simulated data sets based on our proposed (POFM) and double k-means (dkm) methods.

Figure 5

Table 5. The average Rand index based on our proposed (POFM) and double k-means (dkm) methods for 1000 simulated data sets.

Figure 6

Figure 2. Estimated probabilities of replying 3 or above to each of the 2 column clusters for all 3 row clusters, as derived by the biclustering model with R=3\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$R=3$$\end{document}, C=2\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$C=2$$\end{document}.

Figure 7

Table 6. Percent of individuals from the five POFM clusters, represented in the rows, that are clustered in the corresponding five double k-means (Vichi, 2001) clusters.

Supplementary material: File

Matechou et al. supplementary material

Matechou et al. supplementary material
Download Matechou et al. supplementary material(File)
File 108.6 KB