Hostname: page-component-6766d58669-bkrcr Total loading time: 0 Render date: 2026-05-15T09:11:39.151Z Has data issue: false hasContentIssue false

An integrated framework for importance-performance analysis of product attributes and validation from online reviews and maintenance records

Published online by Cambridge University Press:  23 October 2024

Mengyuan Shen
Affiliation:
University of Michigan – Shanghai Jiao Tong University Joint Institute, Shanghai Jiao Tong University, Shanghai, China
Aoxiang Cheng
Affiliation:
University of Michigan – Shanghai Jiao Tong University Joint Institute, Shanghai Jiao Tong University, Shanghai, China
Youyi Bi*
Affiliation:
University of Michigan – Shanghai Jiao Tong University Joint Institute, Shanghai Jiao Tong University, Shanghai, China
*
Corresponding author Youyi Bi youyi.bi@sjtu.edu.cn
Rights & Permissions [Opens in a new window]

Abstract

Importance-performance analysis (IPA) is widely used for needs analysis, product positioning, and strategic planning in product design. Previous research on IPA often employs single-source data such as customer surveys or online reviews with unavoidable subjective bias. In contrast, product maintenance records provide objective information on product quality and failure patterns, which can be cross-validated with customers’ personal experiences from surveys or online reviews. In this paper, we propose an integrated framework for conducting IPA from online reviews and product maintenance records jointly. An attribute-keyword dictionary is first established using keyword extraction and clustering methods. Then, semantic groups, including product attributes and associated descriptions, are extracted using dependency parsing analysis. The sentiment scores of identified product attributes are determined by a voting mechanism using two pre-trained sentiment analysis models. The importance of product attributes in IPA is estimated from the impact of sentiments of each product attribute on product ratings with the extreme gradient boosting (XGBoost) model, while the performance is estimated from the sentiment scores of online reviews or the quality statistics from product maintenance records. In addition, we propose two methods to validate the IPA results, in which the IPA results are compared with the actual product improvements on the market or compared with the analysis of customer reviews from different time periods, respectively. The validated IPA results from online reviews and maintenance records are then integrated to obtain a more comprehensive understanding of customer needs. A case study of passenger vehicles is used to demonstrate the framework. The proposed framework enables automatic data processing and can support companies in making efficient design decisions with more comprehensive perspectives from multisource data.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. An integrated framework for IPA of product attributes from online reviews and product maintenance records. The four main stages are denoted by colored boxes. Red arrows highlight the steps only applicable to online reviews.

Figure 1

Figure 2. The general process for semantic group generation. The description separation approach in the orange box will be explained in Section 3.2.3.

Figure 2

Table 1. Dependency relations identified from dependency parsing analysis

Figure 3

Figure 3. An example of dependency parsing (the original sentence is in Chinese). One sentence can include multiple dependency relations.

Figure 4

Figure 4. The overall workflow of description separation for a sentence with multiple identified attributes. The yellow blocks indicate the input and output of this process.

Figure 5

Table 2. Product attributes identified from online reviews and maintenance records

Figure 6

Table 3. Sample results of the sentiment analysis on customers’ reviews of SUV A in 2018. $ {S}_{ir} $ is the sentiment intensity of product attribute $ i $ ($ i=1,2,\dots, 16 $) in review $ r $

Figure 7

Table 4. Attribute performance for the 2018 model of SUV A. $ {S}_i $ is the average customer sentiment scores to attribute $ i $, and $ \overline{S_i} $ is the attribute performance

Figure 8

Table 5. Attribute importance and occurrence frequency for the 2018 model of SUV A

Figure 9

Figure 5. The importance-performance plot for SUV A in 2018 obtained from online reviews.

Figure 10

Figure 6. The occurrence frequency-performance plot for SUV A in 2018 obtained from online reviews.

Figure 11

Table 6. Attribute performance and importance for maintenance records of SUV A in 2018. $ \overline{P_i} $ is the attribute performance calculated from maintenance records

Figure 12

Figure 7. The importance-performance plot for SUV A in 2018 obtained from online reviews.

Figure 13

Figure 8. The importance-performance plot for SUV A in 2018 obtained from maintenance records.

Figure 14

Table 7. Improvement scores $ {p}_j $ for three vehicle attributes obtained from actual market data. For keyword $ w $ of attribute $ j $, $ {a}_{wj} $ is the weight, and $ {g}_{wj} $ is the flag of improvement (1 means the keyword-related function or component has been improved and 0 means there is no mentioned improvement)

Figure 15

Table 8. Changes in customer sentiment scores for SUV A from 2018 to 2022 based on online reviews. $ \Delta {S}_i $ is the customers’ average sentiment change of attribute $ i $ (see Equation (10))

Figure 16

Figure 9. The importance-performance plot for SUV A in 2018 obtained from online reviews with rankings of changes in customer sentiment scores for these attributes.

Figure 17

Table 9. Improvement scores for three vehicle attributes obtained from online reviews within two model years. $ {p}_{wj} $ is the improvement score of keyword $ w $ from attribute $ j $. $ {p}_j $ is the total improvement score of attribute $ j $

Figure 18

Figure 10. The importance-performance plot for SUV A in 2018 obtained from online reviews.

Figure 19

Figure 11. The importance-performance plot for SUV A in 2022 obtained from online reviews.

Figure 20

Table 10. Occurrence frequency proportion change for vehicle attributes obtained from maintenance records from 2018 to 2019. $ \Delta {P}_i $ is the occurrence frequency change for attribute $ i $ (refer to equations (10) and (13))

Figure 21

Figure 12. The importance-performance plot for SUV A in 2018 obtained from online reviews with rankings of changes of customer sentiment scores for these attributes.

Figure 22

Figure 13. The importance-performance plot obtained from maintenance records in 2018 with rankings of occurrence frequency proportion changes of attributes.

Figure 23

Figure 14. The combined importance-performance plots with $ \alpha =0.5 $ (upper) and $ \alpha =0.7 $ (lower).