Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-08T02:21:55.892Z Has data issue: false hasContentIssue false

A network-based discrete choice model for decision-based design

Published online by Cambridge University Press:  24 March 2023

Zhenghui Sha*
Affiliation:
Walker Department of Mechanical Engineering, The University of Texas at Austin, Austin, TX, USA
Yaxin Cui
Affiliation:
Department of Mechanical Engineering, Northwestern University, Evanston, IL, USA
Yinshuang Xiao
Affiliation:
Walker Department of Mechanical Engineering, The University of Texas at Austin, Austin, TX, USA
Amanda Stathopoulos
Affiliation:
Department of Civil and Environmental Engineering, Northwestern University, Evanston, IL, USA
Noshir Contractor
Affiliation:
Department of Industrial Engineering & Management Sciences, Northwestern University, Evanston, IL, USA
Yan Fu
Affiliation:
Global Data Insight and Analytics, Ford Motor Company, Dearborn, MI, USA
Wei Chen
Affiliation:
Department of Mechanical Engineering, Northwestern University, Evanston, IL, USA
*
Corresponding author: Z. Sha zsha@austin.utexas.edu
Rights & Permissions [Opens in a new window]

Abstract

Customer preference modelling has been widely used to aid engineering design decisions on the selection and configuration of design attributes. Recently, network analysis approaches, such as the exponential random graph model (ERGM), have been increasingly used in this field. While the ERGM-based approach has the new capability of modelling the effects of interactions and interdependencies (e.g., social relationships among customers) on customers’ decisions via network structures (e.g., using triangles to model peer influence), existing research can only model customers’ consideration decisions, and it cannot predict individual customer’s choices, as what the traditional utility-based discrete choice models (DCMs) do. However, the ability to make choice predictions is essential to predicting market demand, which forms the basis of decision-based design (DBD). This paper fills this gap by developing a novel ERGM-based approach for choice prediction. This is the first time that a network-based model can explicitly compute the probability of an alternative being chosen from a choice set. Using a large-scale customer-revealed choice database, this research studies the customer preferences estimated from the ERGM-based choice models with and without network structures and evaluates their predictive performance of market demand, benchmarking the multinomial logit (MNL) model, a traditional DCM. The results show that the proposed ERGM-based choice modelling achieves higher accuracy in predicting both individual choice behaviours and market share ranking than the MNL model, which is mathematically equivalent to ERGM when no network structures are included. The insights obtained from this study further extend the DBD framework by allowing explicit modelling of interactions among entities (i.e., customers and products) using network representations.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. The customer–product bipartite choice network.

Figure 1

Table 1. Networks with different complexities

Figure 2

Table 2. Three major categories of network statistics in exponential random graph models in a customer–product network

Figure 3

Figure 2. Overview of the exponential random graph model-based choice modelling research approach.

Figure 4

Figure 3. Illustration of the choice situation where a customer makes a choice among three alternatives. A customer $ n $ is allowed to make a purchase decision among the three car models, car $ i $, car $ j $ and car $ k $. The figure illustrates the three choice scenarios.

Figure 5

Table 3. The scheme of comparison

Figure 6

Table 4. Descriptive statistics of the 281 sedan car attributes

Figure 7

Table 5. Estimated results of ChoiceSet6 test case with the comparison of different exponential random graph model base choices

Figure 8

Table 6. Estimated results of ChoiceSetAll test case with the comparison of different exponential random graph model base choices

Figure 9

Figure 4. Top-N predictions results for three different test datasets (with mean and error bar) by different models in ChoiceSet6 test case.

Figure 10

Figure 5. Rank correlation comparison between different models and the baseline value from the original compact sedan buyer dataset.

Figure 11

Table 7. Root mean square errors (RMSEs) between the market share of the original compact sedan buyer dataset and the predicted market share of exponential random graph model base choices

Figure 12

Figure 6. Top-N predictions results for three different test datasets (with mean and error bar) by different models in ChoiceSetAll test case.

Figure 13

Figure 7. Decision-based design framework enhanced by network models. The grey boxes are the entities and events from the referred decision-based design framework; the coloured boxes belong to the proposed ERGM-based choice analysis.

Figure 14

Table 8. Estimated results of ChoiceSet6 test case with the comparison between $ {ERGM}_{Both} $ and DCM

Figure 15

Figure 8. Top-N predictions for DCM and $ {ERGM}_{Both} $ model in ChoiceSet6 test case.

Figure 16

Figure 9. The interdependency assumption underlying the network-based models.

Figure 17

Figure A1. Market share comparison of the original compact sedan purchased dataset and the training dataset.