Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-06T03:30:30.158Z Has data issue: false hasContentIssue false

Sali-CAT: A new method for ranking social salience for multiple variables

Published online by Cambridge University Press:  03 March 2026

Xia Hua
Affiliation:
Mathematical Sciences Institute, Australian National University, Canberra, ACT, Australia
Jesse Stewart
Affiliation:
Department of Linguistics, University of Saskatchewan, Saskatoon, SK, Canada
Lindell Bromham
Affiliation:
Research School of Biology, Australian National University, Canberra, ACT, Australia
Cassandra Algy
Affiliation:
Karungkarni Art and Culture Aboriginal Corporation, Kalkaringi, NT, Australia
Felicity Meakins*
Affiliation:
School of Languages and Cultures, University of Queensland, Brisbane, QLD, Australia
*
Corresponding author: Felicity Meakins; Email: f.meakins@uq.edu.au
Rights & Permissions [Opens in a new window]

Abstract

Social salience, the association of a social category with linguistic variables, has been hypothesized to be an important driver of language change. This hypothesis has not been rigorously tested due to the lack of a reliable measure of social salience. In this paper, we present Salience Categorization Test (Sali-CAT), a new approach to measuring the association of word variants with social categories across multiple lexical variables. The approach includes a customized experimental paradigm (three alternative forced choice) and a statistical method to establish the baseline Salience Ratio (Sali-RAT) score for word variants that do not have a bias in usage with respect to the social categories. We demonstrate the approach by testing the association of multiple variables with different generations of speakers in the Gurindji speech community.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press or the rights holder(s) must be obtained prior to any commercial use and/or adaptation of the article.
Copyright
© The Author(s), 2026. Published by Cambridge University Press.
Figure 0

Figure 1. (A) Image of ‘lightning’ presented with the audio of either laitning or janginyina (Kriol and Gurindji for ‘lightning,’ respectively) followed by (B) response options (left B) karu ‘youth,’ (center B) Everybody ‘everyone,’ and (right B) Pulka ‘elder.’

Figure 1

Table 1. Experimental stimuli (Gurindji-origin words bolded)

Figure 2

Table 2. Participants by generation and gender

Figure 3

Figure 2. Percentage of participant responses (y-axis) to five control words which have no variants (x-axis).

Figure 4

Figure 3. Results from our full multinomial logistic regression model based on response patterns (axes plotted on log scale). A Sali-RAT score that is greater than 1.0 along the “elder” axis suggests stronger positive association between the word variant and “elder,” while a Sali-RAT score that is lower than 1.0 along the “elder” axis suggests stronger negative association between the word variant and “elder.” The same applies to the “youth” axis and association between the word variant and “youth.” Confidence intervals along the “elder” or “youth” axis are plotted as a bar for a word variant if the “elder” or “youth” coefficient of the word variant is more significant than any of the five control words. In general, variants with significant coefficients have confidence intervals that do not go across 1.0 along the axis corresponding to the coefficient. The only situation where significance may not agree with confidence intervals is when participants answer “everyone” significantly more or less often than the control words. In this situation, confidence intervals not going across 1.0 allow us to identify salient word variants.

Figure 5

Table 3. Word variants that are significantly positively associated with “elder.” The position of a word variant along the “elder” axis in Figure 4 represents the number of times as many “elder” responses as received by control words, calculated from our full regression model as the ratio between the estimated probability of “elder” answers for the word variant and the estimated probability of “elder” answers for the control words (Sali-RAT score). Significance level is determined by the minimum p-values of coefficients for the five control words in the full model

Figure 6

Figure 4. Participant responses to faya ‘fire’ (left) and warlu ‘fire’ (right).

Figure 7

Table 4. Word variants that have significant positive and negative associations with the “youth.” Number of times as many “youth” responses as control words is the position of a word variant along the “youth” axis in Figure 4, calculated from our full regression model as the ratio between the estimated probability of “youth” answers for the word variant and the estimated probability of “youth” answers for the control words (Sali-RAT score). Significance level is determined by the minimum p-values of coefficients for the five control words in the full model

Figure 8

Table 5. Word variants which show no significant association with “youth” or “elder.” ‘Mouth’ is the only variable of which no word variants are salient

Figure 9

Figure 5. Participant bias estimated from our full multinomial logistic regression model. A larger positive value along the “elder” axis suggests that the participant answers “elder” more often than the control participant, while a negative value along the “elder” axis suggests the participant answers “elder” less often than the control participant. The same applies to the “youth” axis and how often participants answer “youth.” Participants who are located closer to each other have more similar responses. Both axes are square-rooted while keeping the sign in order to spread out data points. Four participants were located far from the center cluster, so we reran all our analyses, removing these four participants.

Figure 10

Table 6. Effects of gender and age on a participant’s response to the control words. Since gender and origin are categorical variables, an age effect (including intercept and slope) is estimated for each of four partitions of the data: female responses to Kriol words; female responses to Gurindji words; male responses to Kriol words; and male responses to Gurindji words. None of the four slopes is significantly different from zero, suggesting no age effect on participants’ responses to control words. These slopes also do not depend on gender or word origin. The intercepts use female responses to Kriol-origin words as the reference level, so, for example, the intercept for female responses to Gurindji-origin words shows how female responses to Gurindji-origin words differ from those to Kriol-origin words. None of the three intercepts is significantly different from zero, suggesting no gender effect on participants’ responses to control words that have either Gurindji or Kriol origin

Figure 11

Figure 6. Proportion of “elder,” “everyone,” and “youth” answers by participants of specific generations (G1 = Generation 1, G2 = Generation 2, G3 = Generation 3, G4 = Generation 4) and genders to word variants with Kriol origin and Gurindji origin.

Figure 12

Table 7. Effects of gender and age on a participant’s response to the 75 word variants. Similar to Table 6, an age effect (including intercept and slope) is estimated for each of four partitions of the data: female responses to Kriol-origin words; female responses to Gurindji-origin words; male responses to Kriol-origin words; and male responses to Gurindji-origin words. Since gender and origin are categorical variables, the intercepts use female responses to Kriol-origin variants as the reference level, so, for example, the intercept for Gurindji, female compares female responses to Gurindji-origin variants to females’ responses to Kriol-origin variants

Supplementary material: File

Hua et al. supplementary material

Hua et al. supplementary material
Download Hua et al. supplementary material(File)
File 2 MB