Hostname: page-component-89b8bd64d-mmrw7 Total loading time: 0 Render date: 2026-05-06T13:54:33.197Z Has data issue: false hasContentIssue false

Discrete Choice Data with Unobserved Heterogeneity: A Conditional Binary Quantile Model

Published online by Cambridge University Press:  29 August 2019

Xiao Lu*
Affiliation:
Department of Political Science, University of Mannheim, A5, 6, 68159, Mannheim, Germany. Email: xiao.lu@gess.uni-mannheim.de
Rights & Permissions [Opens in a new window]

Abstract

In political science, data with heterogeneous units are used in many studies, such as those involving legislative proposals in different policy areas, electoral choices by different types of voters, and government formation in varying party systems. To disentangle decision-making mechanisms by units, traditional discrete choice models focus exclusively on the conditional mean and ignore the heterogeneous effects within a population. This paper proposes a conditional binary quantile model that goes beyond this limitation to analyze discrete response data with varying alternative-specific features. This model offers an in-depth understanding of the relationship between the explanatory and response variables. Compared to conditional mean-based models, the conditional binary quantile model relies on weak distributional assumptions and is more robust to distributional misspecification. The model also relaxes the assumption of the independence of irrelevant alternatives, which is often violated in practice. The method is applied to a range of political studies to show the heterogeneous effects of explanatory variables across the conditional distribution. Substantive interpretations from counterfactual scenarios are used to illustrate how the conditional binary quantile model captures unobserved heterogeneity, which extant models fail to do. The results point to the risk of averaging out the heterogeneous effects across units by conditional mean-based models.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s) 2019. Published by Cambridge University Press on behalf of the Society for Political Methodology.
Figure 0

Figure 1. Comparison between the standard normal distribution, the standard logistic distribution, and the ALDs, with different quantile specifications $\unicode[STIX]{x1D70F}$ (scale fixed to one).

Figure 1

Figure 2. Comparison between the CBQ model (from 0.1 to 0.9 quantiles) and the ordinary logit model.

Figure 2

Table 1. Benchmarking performance of the ordinary logit and the CBQ model.

Figure 3

Figure 3. Coefficients and predicted probability of distance between the rapporteur and the EP median.

Figure 4

Table 2. Voter heterogeneity (calculated as the standard deviation of the nine quantile estimates for each covariate).

Figure 5

Figure 4. Predicted vote shares of the three candidates by varying Bush’s position (corresponding to Figure 1A in Alvarez and Nagler (1995)).

Figure 6

Figure 5. Comparing the CBQ, CL and MXL substitution patterns. The dashed lines are the 45-degree equal-division lines. Black points represent the pairs that are significantly different at a 95% confidence level while the gray points are insignificant. Q1 and Q9 indicate 0.1 and 0.9 quantile estimators, respectively. The average difference is calculated based on significantly different pairs.

Figure 7

Figure 6. Change in predicted probabilities when the largest party becomes the second largest.

Supplementary material: File

Lu et al. supplementary material

Online Appendix

Download Lu et al. supplementary material(File)
File 490.1 KB