Hostname: page-component-5db58dd55d-jnbmb Total loading time: 0 Render date: 2026-05-28T11:37:00.713Z Has data issue: false hasContentIssue false

Graph-based methods for discrete choice

Published online by Cambridge University Press:  06 November 2023

Kiran Tomlinson*
Affiliation:
Cornell University, Ithaca, NY, USA
Austin R. Benson
Affiliation:
Cornell University, Ithaca, NY, USA
*
Corresponding author: Kiran Tomlinson; Email: kt@cs.cornell.edu
Rights & Permissions [Opens in a new window]

Abstract

Choices made by individuals have widespread impacts—for instance, people choose between political candidates to vote for, between social media posts to share, and between brands to purchase—moreover, data on these choices are increasingly abundant. Discrete choice models are a key tool for learning individual preferences from such data. Additionally, social factors like conformity and contagion influence individual choice. Traditional methods for incorporating these factors into choice models do not account for the entire social network and require hand-crafted features. To overcome these limitations, we use graph learning to study choice in networked contexts. We identify three ways in which graph learning techniques can be used for discrete choice: learning chooser representations, regularizing choice model parameters, and directly constructing predictions from a network. We design methods in each category and test them on real-world choice datasets, including county-level 2016 US election results and Android app installation and usage data. We show that incorporating social network structure can improve the predictions of the standard econometric choice model, the multinomial logit. We provide evidence that app installations are influenced by social context, but we find no such effect on app usage among the same participants, which instead is habit-driven. In the election data, we highlight the additional insights a discrete choice framework provides over classification or regression, the typical approaches. On synthetic data, we demonstrate the sample complexity benefit of using social information in choice models.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. Dataset summary. $|A|$: number of choosers (aggregated at the county/precinct for elections), $|U|$: number of items, $|C|$: choice set sizes, $N$: number of observed choices, $d_x$: number of chooser features

Figure 1

Figure 1. Difference between actual and expected install rates (if friendships were irrelevant). The left subplot is with the real network, while the right two are two null models. Each dot is a (participant, app) pair. The black line marks the mean over five bins, with the shaded region showing the standard error of the mean.

Figure 2

Figure 2. 2016 US presidential election vote shares for conservative independent Evan McMullin. Notice his regional popularity and the spillover from Utah to southeast Idaho. McMullin was not on the ballot in filled-in states. The lack of spillover into Colorado may be due to its crowded field (22 candidates) or because it is less conservative than Idaho.

Figure 3

Figure 3. Estimation error of item utilities with (left) and without (right) Laplacian regularization on synthetic data generated according to the priors in Section 4.1.1, with varying homophily strength $\lambda$. Error bars (most are tiny) show standard error over 8 trials. Using Laplacian regularization can improve sample complexity by orders of magnitude.

Figure 4

Figure 4. Test negative log likelihoods (NLL; top row; lower is better) and mean relative ranks (MRR; bottom row; lower is better) on the two Friends and Family datasets and three election datasets (error bars show standard error over chooser sampling). “Logit” signifies plain logit in app-install, CL in app-usage, and MNL in the election datasets. Laplacian regularization improves performance in app-install, while no method improves on CL in app-usage. In the election data, Laplacian MNL, but not GCN, outperforms MNL across train fractions. Propagation performs well on app-install, but very poorly on app-usage, as it does not utilize recency. Despite not using county/precinct features, propagation can be competitive in the election data.

Figure 5

Table 2. Runtime in seconds to train and test each model, with standard err over 4 trials

Figure 6

Table 3. Edge densities within/between the groups preferring Facebook ($|F| = 70$) and Myspace ($|M| = 27$) in app-install. Left: including the 3 choosers in $F\cap M$. Right: excluding $F\cap M$

Figure 7

Table 4. Maximum likelihood 2016 election outcomes under our model under the three scenarios in Section 6.4. We show mean vote shares (with 95% confidence interval over trials) for the top three predicted candidates and differences in state outcomes between the counterfactual prediction and reality. C: Clinton, T: Trump, Outcome: Electoral College votes. T $\rightarrow$ C denotes that a state won by Trump goes for Clinton under the model. States abbreviated by postal code