Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-06T11:57:42.620Z Has data issue: false hasContentIssue false

Explainable lexical entailment with semantic graphs

Published online by Cambridge University Press:  28 February 2022

Adam Kovacs
Affiliation:
Department of Automation and Applied Informatics, Budapest University of Technology and Economics, Budapest, Hungary TU Wien, Vienna, Austria
Kinga Gemes
Affiliation:
Department of Automation and Applied Informatics, Budapest University of Technology and Economics, Budapest, Hungary TU Wien, Vienna, Austria
Andras Kornai
Affiliation:
SZTAKI Institute of Computer Science, Budapest, Hungary
Gabor Recski*
Affiliation:
TU Wien, Vienna, Austria
*
*Corresponding author. E-mail: gabor.recski@tuwien.ac.at
Rights & Permissions [Opens in a new window]

Abstract

We present novel methods for detecting lexical entailment in a fully rule-based and explainable fashion, by automatic construction of semantic graphs, in any language for which a crowd-sourced dictionary with sufficient coverage and a dependency parser of sufficient accuracy are available. We experiment and evaluate on both the Semeval-2020 lexical entailment task (Glavaš et al. (2020). Proceedings of the Fourteenth Workshop on Semantic Evaluation, pp. 24–35) and the SherLIiC lexical inference dataset of typed predicates (Schmitt and Schütze (2019). Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 902–914). Combined with top-performing systems, our method achieves improvements over the previous state-of-the-art on both benchmarks. As a standalone system, it offers a fully interpretable model of lexical entailment that makes detailed error analysis possible, uncovering future directions for improving both the semantic parsing method and the inference process on semantic graphs. We release all components of our system as open source software.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Table 1. Example entries of the monolingual binary portion of the Semeval lexical entailment dataset (Glavaš et al. 2020)

Figure 1

Table 2. Example entries of the SherLlic dataset (Schmitt and Schütze 2019). Argument labels indicate entity types: PER – person, LOC – location, ORGF – organization_founder, EMPL – employer, AUTH – book_author

Figure 2

Figure 1. Dependency parse of the definition of jewel.

Figure 3

Figure 2. 4lang definition graph of jewel.

Figure 4

Table 3. Official monolingual LE results on the ANY track (F-scores)

Figure 5

Figure 3. Example of an IRTG rule.

Figure 6

Figure 4. Graph representation of sample rule in Figure 3.

Figure 7

Figure 5. Obliques in UD and 4lang.

Figure 8

Figure 6. 4lang definition graph of John is supported by Bill and Bill is supporting John.

Figure 9

Figure 7. Relative clause modifier of a noun in UD and 4lang.

Figure 10

Table 4. Mapping from UD relations to 4lang subgraphs

Figure 11

Figure 8. 4lang representations of A is nation in B and A  is country in B.

Figure 12

Figure 9. Appending zero edges to premise.

Figure 13

Figure 10. Expanded 4lang representations of A is nation in B.

Figure 14

Figure 11. 4lang definition graphs of expanding and reducing A is heart of B.

Figure 15

Table 5. Performance on the Semeval development set. 4lang and 4lang_syn is our method without and with additional synonym nodes from WordNet and Wiktionary. WordNet is the baseline using WordNet hypernyms, 4lang_syn+WordNet is the union of 4lang+syn and WordNet

Figure 16

Table 6. Examples of entailment pairs not in WordNet but detected by our system

Figure 17

Table 7. Performance on the Semeval test set. 4lang and 4lang_syn is our method without and with additional synonym nodes from WordNet and Wiktionary. WordNet is the baseline using WordNet hypernyms, all is the union of 4lang+syn and WordNet. Previous top-scoring systems on each task are BMEAUT (Kovács et al. 2020) and SHIKEBLCU (Wang et al. 2020)

Figure 18

Table 8. Performance on the SherLlic test set. WordNet is the baseline using WordNet hypernyms, ESIM (Chen et al. 2017) is the strongest system evaluated that wasn’t tuned on SherLlic’s held-out portion and w2v+tsg_rel_emb is the overall strongest system of Schmitt and Schütze (2019). 4lang_high_prec and 4lang_high_fscore are the configurations of our system tuned for high precision and high F-score, respectively

Figure 19

Table 9. Comparing our system to RoBERTa. Examples from Schmitt and Schütze (2019)