Hostname: page-component-89b8bd64d-b5k59 Total loading time: 0 Render date: 2026-05-06T14:49:41.555Z Has data issue: false hasContentIssue false

A database of ambiguous Chinese characters with measures for meaning dominance and meaning balance

Published online by Cambridge University Press:  23 September 2024

Huilin Chen
Affiliation:
Shanghai Jiao Tong University, Shanghai, China
Xu Xu*
Affiliation:
Shanghai Jiao Tong University, Shanghai, China
Haiquan Li
Affiliation:
Shanghai Jiao Tong University, Shanghai, China
Xinyue Yu
Affiliation:
Shanghai Jiao Tong University, Shanghai, China
Ruting Pan
Affiliation:
Shanghai Jiao Tong University, Shanghai, China
Zhaoyang Zhang
Affiliation:
Shanghai Jiao Tong University, Shanghai, China
*
Corresponding author: Xu Xu; Email: xu2xu3@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Chinese characters hold great potential to help inform and enrich psycholinguistic research on lexical ambiguity as a large portion of them are ambiguous in nature with meaning varying from context to context. This report presents a psycholinguistic database that contains over 2000 characters with normative measures for meaning dominance and meaning balance, that is, the relative frequency of each meaning associated with a target character and the degree of balance across the meanings of the character. The measurement process takes advantage of the fact that, in Chinese, generating words containing a target character is the most convenient way to specify and disambiguate character meanings. Character meanings stored in ordinary people’s mental lexicon are identified based on the words, along with a small portion of meaning descriptions, listed by over 900 native speakers. The measures of meaning dominance and meaning balance for the characters are derived from computing the relative frequencies of the meanings. Potential research and practical applications of the database, as a valuable tool, to enhance our understanding of the acquisition, representation, and processing of ambiguous lexical items are discussed.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NC
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial licence (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Gender, age, education level, and region distributions of the participants.

Figure 1

Table 1. Coded responses, along with their raw frequencies, for the character 北

Figure 2

Figure 2. The distribution of generated number of meanings (gNoM).

Figure 3

Table 2. Measures of meaning balance (beta and D) and relative frequencies for the most and the second most dominant meanings (separate and combined) of the characters grouped by gNoM

Figure 4

Figure 3. The distributions of beta and the D for characters with gNoM > 1.

Figure 5

Table 3. Correlations between beta, D, and other variables