Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-06T18:57:16.138Z Has data issue: false hasContentIssue false

Grammatical unidirectionality is not reflected in individual preferences when performing artificial semantic extension

Published online by Cambridge University Press:  01 August 2025

Anna Kapron-King*
Affiliation:
School of Informatics, University of Edinburgh, Edinburgh, UK Centre for Language Evolution, School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, UK
Simon Kirby
Affiliation:
Centre for Language Evolution, School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, UK
Graeme Trousdale
Affiliation:
School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, UK
Kenny Smith
Affiliation:
Centre for Language Evolution, School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, UK
*
Corresponding author: Anna Kapron-King; Email: a.kapron-king@ed.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Grammaticalization is the process whereby lexical items change into grammatical items. This phenomenon is widely attested, while the change from grammatical to lexical is far less common. We ran two experiments to test whether this unidirectional tendency originates with a preference for extending lexical meanings to grammatical ones rather than vice versa. We focus on body parts and spatial relations. In Experiment 1, participants were told the meaning of an artificial word then rated how likely it is that that word can also be used to refer to a second meaning – one meaning was a body part and one a preposition. We expected higher ratings when extending from body parts to prepositions than vice versa but found no difference. In Experiment 2, participants performed semantic extension in communication. We varied whether they extended words for body parts to prepositions or vice versa. Again, we found no asymmetry. Finally, we used a model of Experiment 2 to show that asymmetrical extension follows straightforwardly if there is an asymmetry in the number of words available relative to the number of meanings to express, indicating that having a larger number of lexical items than grammatical concepts could be an alternative source of unidirectionality.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Observed instances of grammaticalization from body part to spatial concept (Kuteva et al., 2019)

Figure 1

Figure 1. Example of a space-to-body trial from Experiment 1. The participant saw the first screen, then gave their rating on the second screen.

Figure 2

Table 2. The pairs of meanings used as stimuli in Experiment 1, derived from Heine and Kuteva (2002)

Figure 3

Figure 2. Experiment 1 ratings. Each dot represents a single response from a single participant. Response values are on a scale from very unlikely to very likely. The responses are split by condition: body-to-space, where the first meaning was a body part and the second was a preposition, and space-to-body, where the order was reversed. The violin plots show the density of responses along the y-axis. The box plots indicate the 25th percentile (lower hinge) and 75th percentile (upper hinge) with a dark line indicating the median. The whiskers extend $ 1.5\ast \mathtt{IQR} $ from the hinges. Contrary to our prediction, there was no difference in participants’ responses between the two conditions.

Figure 4

Figure 3. Experiment 1 ratings by body part–preposition pair, showing no clear preference for the predicted direction of semantic change. Plotting conventions as in Figure 2.

Figure 5

Table 3. Formulas for the mixed-effects beta-regression models fit to the Experiment 1 data

Figure 6

Table 4. Lists of body parts and spatial prepositions used as stimuli in Experiment 2

Figure 7

Figure 4. The three kinds of training used in the first phase of Experiment 2. Upper panel: an observation trial, the participant passively observes the word plus associated meaning for 3 seconds. Middle panel: comprehension trial, the participant selects the meaning of a word and receives feedback. Lower panel: production trial, the participant selects the word for a given meaning and receives feedback.

Figure 8

Figure 5. The two roles in the communication phase of Experiment 2. Upper panel: the sender’s view, the participant chooses a word to refer to the highlighted meaning. Lower panel: the receiver’s view, the participant sees a word and selects the intended meaning.

Figure 9

Figure 6. Communicative success in Experiment 2. Each dot represents a single participant and indicates the proportion of their responses as the receiver that were correct. The dotted line indicates chance performance, i.e., random selection from among 6 possible meanings. The ‘seen targets’ facet shows results for trials where the target was one of the six meanings the participant encountered during training: body parts for the body-to-space participants and prepositions for the space-to-body participants. The ‘unseen targets’ facet shows results for trials where the target was from the opposite category from what the participant saw in training. As expected, receivers are highly successful for seen targets. However, unexpectedly, success is lower when the seen target is a preposition, suggesting these may be harder to learn. We expected receivers to have higher correctness for unseen targets in the body-to-space than the space-to-body condition, but found no significant difference.

Figure 10

Figure 7. Proportion of predicted responses in the communication phase of Experiment 2. The predicted response for a seen target is simply the word the participant learned for that target in the learning phase; for unseen targets, the predicted response is the paired concept derived from the World Lexicon of Grammaticalization, see main text. We again found no significant difference between the conditions.

Figure 11

Figure 8. The entropy of sender responses for each target meaning, split by whether the target was seen or unseen. Each dot represents the entropy for one target meaning. Unseen targets had higher entropy than seen targets, and seen prepositions (space-to-body) had higher entropy than seen body parts (body-to-space).

Figure 12

Figure 9. Heatmaps showing response probabilities in unseen trials for both senders and receivers, with colour indicating condition. The sender heatmaps have target meanings on the y-axis and sender responses on the x-axis (indicated by the meaning the participant originally learned for the word they responded with). The receiver heatmaps have the meaning of the word the sender said on the y-axis, and the receiver’s response on the x-axis. The values represent the probability of responding with meaning $ {m}_1 $ given the observed meaning $ {m}_0 $, so each row in each subplot sums to 1.

Figure 13

Table 5. Parameters and functions used in the model of the sender–receiver task from Experiment 2

Figure 14

Figure 10. Proportion of trials where the model’s receiver produced the correct response. Responses were generated 100 times for each target meaning, each list, each direction (body-to-space and space-to-body), and each manipulation. The error bar shows the 95% confidence interval. The dotted line shows chance performance, which varies depending on the number of meanings the receiver selects from. Receiver success was higher when agents had more words and fewer meanings to refer to than when they had fewer words and more meanings. Direction had no effect on success.

Supplementary material: File

Kapron-King et al. supplementary material

Kapron-King et al. supplementary material
Download Kapron-King et al. supplementary material(File)
File 316.5 KB