Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-11T13:44:18.392Z Has data issue: false hasContentIssue false

Partial Identification of Latent Correlations with Ordinal Data

Published online by Cambridge University Press:  01 January 2025

Jonas Moss
Affiliation:
BI Norwegian Business School
Steffen Grønneberg*
Affiliation:
BI Norwegian Business School
*
Correspondence should be made to Steffen Grønneberg, Department of Economics, BI Norwegian Business School, 0484 Oslo, Norway. Email: steffeng@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

The polychoric correlation is a popular measure of association for ordinal data. It estimates a latent correlation, i.e., the correlation of a latent vector. This vector is assumed to be bivariate normal, an assumption that cannot always be justified. When bivariate normality does not hold, the polychoric correlation will not necessarily approximate the true latent correlation, even when the observed variables have many categories. We calculate the sets of possible values of the latent correlation when latent bivariate normality is not necessarily true, but at least the latent marginals are known. The resulting sets are called partial identification sets, and are shown to shrink to the true latent correlation as the number of categories increase. Moreover, we investigate partial identification under the additional assumption that the latent copula is symmetric, and calculate the partial identification set when one variable is ordinal and another is continuous. We show that little can be said about latent correlations, unless we have impractically many categories or we know a great deal about the distribution of the latent vector. An open-source R package is available for applying our results.

Information

Type
Theory and Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Copyright
Copyright © 2022 The Author(s) under exclusive licence to The Psychometric Society
Figure 0

Figure 1. Upper and lower limits for ρΠ(F)\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$\rho _{\varvec{\Pi }}(\mathcal {F})$$\end{document} when the marginals are fixed. The dashed line is the polychoric correlation, corresponding to normal marginals and the normal copula.

Figure 1

Figure 2. Illustration of Theorem 2 and Corollary 2. The black lines are the limits of the identification sets in Corollary 2, the black dashed lines are the limits of identification sets in Theorem 2, and the gray dashed line is the true polychoric correlation.

Figure 2

Figure 3. Polychoric correlation estimates for 25 items from the International Personality Item Pool (Goldberg, 1999).

Figure 3

Figure 4. Upper (blue) and lower (red) correlation bounds for the items in the International Personality Item Pool (Goldberg, 1999).

Supplementary material: File

Moss and Grønneberg supplementary material

Moss and Grønneberg supplementary material
Download Moss and Grønneberg supplementary material(File)
File 1.1 MB