Hostname: page-component-77f85d65b8-t6st2 Total loading time: 0 Render date: 2026-03-30T01:30:34.806Z Has data issue: false hasContentIssue false

Abstract meaning representation of Turkish

Published online by Cambridge University Press:  28 April 2022

Elif Oral
Affiliation:
NLP Research Group, Faculty of Computer and Informatics, Istanbul Technical University, Istanbul, Turkey
Ali Acar
Affiliation:
NLP Research Group, Faculty of Computer and Informatics, Istanbul Technical University, Istanbul, Turkey
Gülşen Eryiğit*
Affiliation:
NLP Research Group, Faculty of Computer and Informatics, Istanbul Technical University, Istanbul, Turkey Department of Artificial Intelligence and Data Engineering, Istanbul Technical University, Istanbul, Turkey
*
*Corresponding author. E-mail: gulsen.cebiroglu@itu.edu.tr
Rights & Permissions [Opens in a new window]

Abstract

Abstract meaning representation (AMR) is a graph-based sentence-level meaning representation that has become highly popular in recent years. AMR is a knowledge-based meaning representation heavily relying on frame semantics for linking predicate frames and entity knowledge bases such as DBpedia for linking named entity concepts. Although it is originally designed for English, its adaptation to non-English languages is possible by defining language-specific divergences and representations. This article introduces the first AMR representation framework for Turkish, which poses diverse challenges for AMR due to its typological differences compared to English; agglutinative, free constituent order, morphologically highly rich resulting in fewer word surface forms in sentences. The introduced solutions to these peculiarities are expected to guide the studies for other similar languages and speed up the construction of a cross-lingual universal AMR framework. Besides this main contribution, the article also presents the construction of the first AMR corpus of 700 sentences, the first AMR parser (i.e., a tree-to-graph rule-based AMR parser) used for semi-automatic annotation, and the evaluation of the introduced resources for Turkish.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. AMR interaction with knowledge bases and other NLP resources. (Dashed lines represent optional interactions.)

Figure 1

Figure 2. A sample AMR representation in graph and Penman notations.

Figure 2

Figure 3. AMR representations for nominal verbs produced with -lAn.

Figure 3

Figure 4. Nominalized verb samples.

Figure 4

Table 1. Some suffixes forming converbs and their corresponding AMR relations

Figure 5

Figure 5. Annotation of omitted pronouns in nominalized verbs.

Figure 6

Table 2. Modality samples

Figure 7

Figure 6. Modality representation in Turkish AMR.

Figure 8

Figure 7. AMR representation of the verb voice structures.

Figure 9

Figure 8. AMR representation of the -CA suffix.

Figure 10

Figure 9. AMR representation of emphatic reduplication and m-reduplication.

Figure 11

Figure 10. AMR representation of a copula marker occurring after a locative marker.

Figure 12

Figure 11. Alignment of the word yıllardır.

Figure 13

Table 3. A sentence “Bu ilişkiyi bitirelim, böyle yürütemeyeceğim, dedi.” (Let’s end this relationship, I can’t run it like this, she said) in the Turkish PropBank. The columns provide words’ position within the sentence, surface form, lemma, parts-of-speech tags, morphological features, head word index, dependency relation, and the PropBank tag, respectively. The annotation “Y” indicates that the following tag is a verb frame

Figure 14

Figure 12. Inter-step tree and AMR graph for “Bu ilişkiyi bitirelim, böyle yürütemeyeceğim, dedi.” (Let’s end this relationship, I can’t run it like this, she said).

Figure 15

Table 4. Actions

Figure 16

Table 5. Direct mapping of PropBank relations to AMR relations

Figure 17

Table 6. Annotation times

Figure 18

Table 7. Annotation agreements based on linguistic phenomena