Hostname: page-component-89b8bd64d-9prln Total loading time: 0 Render date: 2026-05-08T02:28:36.588Z Has data issue: false hasContentIssue false

Does word knowledge account for the effect of world knowledge on pronoun interpretation?

Published online by Cambridge University Press:  12 February 2024

Cameron R. Jones*
Affiliation:
Department of Cognitive Science, UC San Diego, San Diego, CA, USA
Benjamin Bergen
Affiliation:
Department of Cognitive Science, UC San Diego, San Diego, CA, USA
*
Corresponding author: Cameron Jones; Email: cameron@ucsd.edu
Rights & Permissions [Opens in a new window]

Abstract

To what extent can statistical language knowledge account for the effects of world knowledge in language comprehension? We address this question by focusing on a core aspect of language understanding: pronoun resolution. While existing studies suggest that comprehenders use world knowledge to resolve pronouns, the distributional hypothesis and its operationalization in large language models (LLMs) provide an alternative account of how purely linguistic information could drive apparent world knowledge effects. We addressed these confounds in two experiments. In Experiment 1, we found a strong effect of world knowledge plausibility (measured using a norming study) on responses to comprehension questions that probed pronoun interpretation. In experiment 2, participants were slower to read continuations that contradicted world knowledge-consistent interpretations of a pronoun, implying that comprehenders deploy world knowledge spontaneously. Both effects persisted when controlling for the predictions of GPT-3, an LLM, suggesting that pronoun interpretation is at least partly driven by knowledge about the world and not the word. We propose two potential mechanisms by which knowledge-driven pronoun resolution occurs, based on validation- and expectation-driven discourse processes. The results suggest that while distributional information may capture some aspects of world knowledge, human comprehenders likely draw on other sources unavailable to LLMs.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Table 1. Experiment 1 example item versions and responses

Figure 1

Figure 1. We used norming studies to independently measure the linguistic and world knowledge bias toward each of the noun phrases (NP1 and NP2) in our stimuli. Linguistic bias (top, green) was unimodally skewed toward NP1, likely reflecting the effects of subjecthood and grammatical parallelism biases. World knowledge bias (bottom, blue) was bimodally and symmetrically distributed, indicating high agreement and reflecting the fact that reversing the order of each item effectively reverses its bias with respect to NP1/NP2-coding.

Figure 2

Figure 2. Left: Linguistic factors, such as grammatical role, had little influence on whether the second noun phrase (NP2) was selected as an antecedent ($ r=-0.129 $, $ {\chi}^2(1)=0.387 $, $ p=0.534 $). Centre: Distributional likelihood (operationalised as GPT-3 probability) was positively correlated with pronoun resolution decisions, and explained significant variance controlling for linguistic factors ($ r=0.482 $, $ {\chi}^2(1)=20.1 $, $ p<0.001 $). Right: The world knowledge plausibility of NP2 positively predicted whether it is selected as an antecedent, controlling for linguistic and distributional factors ($ r=0.714 $, $ {\chi}^2(1)=50.8 $, $ p<0.001 $).

Figure 3

Table 2. The full model predicting pronoun resolution decisions in experiment 1.

Figure 4

Figure 3. An example passage stimulus and attention check question from experiment 2. The order of the possible antecedents is counterbalanced across participants by swapping the positions of NP1 and NP2. Continuations are made to contradict one interpretation of the pronoun by asserting that one of the NPs (here, NP1) is in a state that is inconsistent with it having been the referent of the pronoun (CONT). Unambiguous, consistent control sentences are generated by replacing the referring expression (REF) with an explicit reference to the NP not mentioned in the continuation. See Table 3 for a full list of item permutations.

Figure 5

Table 3. Experiment 2 item versions

Figure 6

Table 4. Effects of world knowledge bias on log reading time across continuation regions

Figure 7

Figure 4. Mean reading time for each recorded region with 95% confidence intervals. Reading time in region 5 is 55 ms slower when the continuation is inconsistent with the more physically plausible interpretation of the ambiguous pronoun (top; $ {\chi}^2(1)=9.94 $, $ p=0.006 $). This difference is not seen when the critical sentence refers to objects unambiguously (bottom), indicating that the slowdown is due to inconsistency with the pronoun interpretation, rather than the continuation sentence itself taking longer to read.

Figure 8

Table 5. The full model predicting reading times in region 5 found a significant effect of world knowledge controlling for the effect of GPT-3 surprisal

Figure 9

Figure 5. Mean GPT-3 surprisal for each region with 95% confidence intervals. Surprisal appears larger in regions 3 and 4 for continuations that are inconsistent vs consistent with world knowledge. However, world knowledge bias continues to have a strong positive effect on reading times for region 5 when controlling for GPT-3 surprisal across regions 2–5 ($ {\chi}^2(1)=9.87 $, $ p=0.007 $), indicating that the influence of world knowledge on human comprehenders cannot be accounted for by distributional information.