Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-10T23:56:55.081Z Has data issue: false hasContentIssue false

Multilevel Analysis with Few Clusters: Improving Likelihood-Based Methods to Provide Unbiased Estimates and Accurate Inference

Published online by Cambridge University Press:  13 May 2020

Martin Elff*
Affiliation:
Department of Political and Social Sciences, Zeppelin University
Jan Paul Heisig
Affiliation:
Health and Social Inequality Research Group, WZB Berlin Social Science Center
Merlin Schaeffer
Affiliation:
Department of Sociology, University of Copenhagen
Susumu Shikano
Affiliation:
Department of Politics and Public Administration, University of Konstanz
*
*Corresponding author. E-mail: martin.elff@zu.de
Rights & Permissions [Opens in a new window]

Abstract

Quantitative comparative social scientists have long worried about the performance of multilevel models when the number of upper-level units is small. Adding to these concerns, an influential Monte Carlo study by Stegmueller (2013) suggests that standard maximum-likelihood (ML) methods yield biased point estimates and severely anti-conservative inference with few upper-level units. In this article, the authors seek to rectify this negative assessment. First, they show that ML estimators of coefficients are unbiased in linear multilevel models. The apparent bias in coefficient estimates found by Stegmueller can be attributed to Monte Carlo Error and a flaw in the design of his simulation study. Secondly, they demonstrate how inferential problems can be overcome by using restricted ML estimators for variance parameters and a t-distribution with appropriate degrees of freedom for statistical inference. Thus, accurate multilevel analysis is possible within the framework that most practitioners are familiar with, even if there are only a few upper-level units.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
Copyright © Cambridge University Press 2020
Figure 0

Figure 1. Performance of ML point estimates of upper-level covariate effects in multilevel linear and probit modelsNote: The figure displays relative biases of ML point estimates (in percent of the true effect size). Vertical lines depict 95 per cent Monte Carlo confidence intervals for these results. The horizontal zero line denotes the reference of no bias. Black triangles replicate the results presented in the left column (‘Estimate’) of Figure 2 on page 754 in Stegmueller (2013). We additionally present two modifications of Stegmueller's analysis. The first (black circles) increases the number of replications from 1,000 to 10,000, leaving everything else the same. The second (grey squares) follows Stegmueller in using only 1,000 replications, but specifies different random number seeds for the different experimental conditions. The Monte Carlo confidence intervals are computed on the base of the standard deviation of the estimates across Monte Carlo replications divided by the square root of the Monte Carlo sample size (the number of simulation replications), and the 2.5 and 97.5 percentiles of the standard normal distribution (i.e. −1.96 and +1.96).

Figure 1

Figure 2. Performance of likelihood-based estimators of random intercept variances in multilevel linear and probit modelsNote: The figure displays relative bias (in percent of the true parameter size) in variance estimates for the random intercept. Vertical lines depict 95 per cent confidence intervals. The horizontal zero line denotes the reference of no bias. The Monte Carlo sample size is 5,000. The confidence intervals are constructed analogously to those in Figure 1.

Figure 2

Figure 3. Performance of likelihood-based confidence intervals for upper-level covariate effect in multilevel linear and probit modelsNote: The figure shows percentage point deviations of actual coverage rates from the nominal value of 95 per cent. The horizontal zero line denotes the reference of accurate coverage (i.e., actual equals nominal coverage rate) based on 5,000 Monte Carlo replications. The dashed horizontal lines indicate 95 per cent test intervals. For an accurate estimator of the 95 per cent confidence interval (i.e., one that has an actual coverage rate of 95 per cent), the estimated actual coverage rate should fall into the test interval 95 per cent of the time. In this sense, estimated coverage rates falling outside the test interval constitute statistically significant evidence against an accurate coverage rate. The test intervals are constructed to range from the 2.5 percentile to the 97.5 percentile (thus containing 95 per cent of the probability mass) of a binomial distribution with success probability p(1) = 0.95 and size parameter n = 5,000. This figure corresponds to the right-hand panel ‘CI non-coverage’ of Figure 2 on page 754 in Stegmueller (2013).

Figure 3

Figure 4. Performance of degrees of freedom approximations for the sampling distribution of test statistics in multilevel linear modelsNote: The figure shows percentage point deviations of actual coverage rates from the nominal value of 95 per cent. The horizontal zero line denotes the reference of accurate coverage (i.e., actual equals nominal coverage rate) based on 5,000 Monte Carlo replications. The dashed horizontal lines indicate 95 per cent test intervals that are constructed in the same way as in Figure 3. This figure has no correspondence in Stegmueller (2013).

Figure 4

Figure 5. Country-level determinants of support for the European UnionNote: The figure displays point estimates and their 95 per cent confidence or credible intervals. For (restricted) ML estimates, thick lines represent 95 per cent confidence intervals based on the normal distribution, and thin antlers represent intervals based on the t-distribution with m − l − 1 degrees of freedom. Sample size: 10,777 individuals, 14 countries. This figure corresponds to Figure 8 on page 758 in Stegmueller (2013).

Supplementary material: Link

Elff et al. Dataset

Link
Supplementary material: PDF

Elff et al. supplementary material

Online Appendix

Download Elff et al. supplementary material(PDF)
PDF 392.8 KB