Hostname: page-component-89b8bd64d-rbxfs Total loading time: 0 Render date: 2026-05-07T11:10:22.255Z Has data issue: false hasContentIssue false

Tracing thick and thin concepts through corpora

Published online by Cambridge University Press:  07 August 2023

Kevin Reuter*
Affiliation:
Department of Philosophy, University of Zurich, Zurich, Switzerland
Lucien Baumgartner
Affiliation:
Department of Philosophy, University of Zurich, Zurich, Switzerland
Pascale Willemsen
Affiliation:
Department of Philosophy, University of Zurich, Zurich, Switzerland
*
Corresponding author: Kevin Reuter; Email: kevin.reuter@uzh.ch
Rights & Permissions [Opens in a new window]

Abstract

Philosophers and linguists currently lack the means to reliably identify evaluative concepts and measure their evaluative intensity. Using a corpus-based approach, we present a new method to distinguish evaluatively thick and thin adjectives like ‘courageous’ and ‘awful’ from descriptive adjectives like ‘narrow,’ and from value-associated adjectives like ‘sunny.’ Our study suggests that the modifiers ‘truly’ and ‘really’ frequently highlight the evaluative dimension of thick and thin adjectives, allowing for them to be uniquely classified. Based on these results, we believe our operationalization may pave the way for a more quantitative approach to the study of thick and thin concepts.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. eval values, as well as ‘truly,’ ‘really,’ and ‘very’ uses per thousand (per mill) for all 45 adjectives using data from COCA, as well as the average values for each predefined category. For example, take the values for the adjective ‘courageous’: for every 1,000 uses of the term ‘courageous,’ we find that it is modified with ‘truly’ 2.74 times, with ‘really’ 4.22 times, and with ‘very’ 50.19 times.

Figure 1

Table 2. Ratios and eval values for six adjectives using data from Reddit.

Figure 2

Figure 1. Tree diagram displaying the clusters using hierarchical cluster analysis. We specified three clusters, two of which (left and right) feature almost only evaluative terms, and one cluster (middle) only contains nonevaluative adjectives. Terms in light grey are the selected nonevaluative adjectives, whereas the ones in black are evaluative adjectives.

Figure 3

Figure 2. Predicted probabilities for class membership. The first five charts are single plots for the 5 predefined categories, the sixth is a combined plot without confidence intervals. The x-axis indicates the $ eval $ number. The outliers ‘awful’ and ‘disgusting’ were not included.

Figure 4

Figure 3. Predicted probability for being an evaluative concept, with confidence intervals.

Figure 5

Table 3. Eval values (based on COCA) for disputed concepts and predicted probabilities for formerly unobserved adjectives based on the logistic regression model. If the predicted probability is >0.5, the adjectives would be considered evaluative.

Supplementary material: PDF

Reuter et al. supplementary material

Reuter et al. supplementary material

Download Reuter et al. supplementary material(PDF)
PDF 78.6 KB