Hostname: page-component-5db58dd55d-jnbmb Total loading time: 0 Render date: 2026-05-26T23:59:16.407Z Has data issue: false hasContentIssue false

Toward an ontological representation of fictional characters

Published online by Cambridge University Press:  20 February 2026

Antoine Bourgois*
Affiliation:
Lattice (CNRS & ENS-PSL & Université Sorbonne Nouvelle), Montrouge, 92120, France
Jean Barré
Affiliation:
Lattice (CNRS & ENS-PSL & Université Sorbonne Nouvelle), Montrouge, 92120, France
Olga Seminck
Affiliation:
Lattice (CNRS & ENS-PSL & Université Sorbonne Nouvelle), Montrouge, 92120, France
Thierry Poibeau
Affiliation:
Lattice (CNRS & ENS-PSL & Université Sorbonne Nouvelle), Montrouge, 92120, France
*
Corresponding author: Antoine Bourgois; Email: antoine.bourgois@protonmail.com
Rights & Permissions [Opens in a new window]

Abstract

Characters are central to narrative theory but remain under-specified in computational work, where they are often reduced to clusters of words or vectors. We propose an operationalizable ontology of characterization that bridges narratological theory and NLP. From BERT-based clustering of character descriptions, we derive 17 classes of attributes (actions, emotions, traits, relations, possessions, etc.), validated through manual annotation ($k = 0.77$) and automatic classification (64% accuracy vs. 12% baseline). Applied to character similarity tasks for French fiction, our framework outperforms existing models. By aligning narratological insights with computational methods, we move toward a representation of fictional characters as structured, comparable entities for large-scale literary analysis.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-ShareAlike licence (https://creativecommons.org/licenses/by-sa/4.0), which permits re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Table 1. Ontology of character attributes grouped into four macro-classes

Figure 1

Table 2. Accuracy results on the French adaptation of the Bamman, Underwood, and Smith (2014) similarity benchmark

Figure 2

Table 3. CharaSim-fr benchmark results summary

Figure 3

Table 4. Cosine similarities over fine-grained ontological classes

Figure 4

Table 5. Cosine similarities over fine-grained ontological classes

Figure 5

Figure A.1 Character attributes annotation interface.

Figure 6

Figure A.2 Accuracy by classes count for our exhaustive combination search. Results on the French adaptation of the Bamman et al. 2014; similarity benchmark.

Figure 7

Table C.1. CharaSim-fr benchmark detailed results

Submit a response

Rapid Responses

No Rapid Responses have been published for this article.