Hostname: page-component-77f85d65b8-6bnxx Total loading time: 0 Render date: 2026-03-28T17:53:55.348Z Has data issue: false hasContentIssue false

Determining sentiment views of verbal multiword expressions using linguistic features

Published online by Cambridge University Press:  15 May 2023

Michael Wiegand*
Affiliation:
Digital Age Research Center, Alpen-Adria-Universität Klagenfurt, Klagenfurt am Wörthersee, Austria Sprach- & Signalverarbeitung, Universität des Saarlandes, Saarbrücken, Germany
Marc Schulder
Affiliation:
Sprach- & Signalverarbeitung, Universität des Saarlandes, Saarbrücken, Germany Institut für Deutsche Gebärdensprache, Hamburg, Germany
Josef Ruppenhofer
Affiliation:
Leibniz-Institut für Deutsche Sprache, Mannheim, Germany
*
Corresponding author: M. Wiegand; Email: michael.wiegand@aau.at
Rights & Permissions [Opens in a new window]

Abstract

We examine the binary classification of sentiment views for verbal multiword expressions (MWEs). Sentiment views denote the perspective of the holder of some opinion. We distinguish between MWEs conveying the view of the speaker of the utterance (e.g., in “The company reinvented the wheel the holder is the implicit speaker who criticizes the company for creating something already existing) and MWEs conveying the view of explicit entities participating in an opinion event (e.g., in “Peter threw in the towel the holder is Peter having given up something). The task has so far been examined on unigram opinion words. Since many features found effective for unigrams are not usable for MWEs, we propose novel ones taking into account the internal structure of MWEs, a unigram sentiment-view lexicon and various information from Wiktionary. We also examine distributional methods and show that the corpus on which a representation is induced has a notable impact on the classification. We perform an extrinsic evaluation in the task of opinion holder extraction and show that the learnt knowledge also improves a state-of-the-art classifier trained on BERT. Sentiment-view classification is typically framed as a task in which only little labeled training data are available. As in the case of unigrams, we show that for MWEs a feature-based approach beats state-of-the-art generic methods.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. Unigram lexicon with sentiment-view information from Wiegand et al. (2016)

Figure 1

Table 2. Illustration of entries from the unigram sentiment-view lexicon from Wiegand et al. (2016)

Figure 2

Table 3. (Verbal) MWEs in different lexical resources

Figure 3

Table 4. Illustration of entries from the MWE gold standard lexicon

Figure 4

Table 5. The MWE gold standard lexicon

Figure 5

Figure 1. Illustration of a Wiktionary-entry.

Figure 6

Table 6. The different corpora used

Figure 7

Figure 2. Illustration of graph-based approach (unigram opinion words are labeled seeds).

Figure 8

Table 7. The 10 most similar unigrams for two different MWEs (embeddings were induced on the corpus LIU); unigrams conveying a sentiment view other than that of the MWE are in bold type

Figure 9

Table 8. Label propagation on different corpora and vector representations

Figure 10

Table 9. Average proportion of the different parts of speech among the 10 most similar unigram opinion words (embeddings were induced on LIU)

Figure 11

Table 10. The different representation foci

Figure 12

Table 11. Summary of all features used

Figure 13

Figure 3. Polarity distribution among the different sentiment views.

Figure 14

Figure 4. Sentiment-view distribution on light-verb constructions.

Figure 15

Figure 5. View distribution and token length.

Figure 16

Table 12. Global constraints enforcing sentiment-view consistency as incorporated in MLN

Figure 17

Figure 6. Learning curve using MLN, the best supervised classifier.

Figure 18

Table 13. Comparison of features groups

Figure 19

Table 14. Comparison of representation foci

Figure 20

Table 15. Comparison of different classifiers

Figure 21

Table 16. Derivation of opinion holders from sentiment views

Figure 22

Table 17. Summary of different classifiers used for opinion holder extraction

Figure 23

Table 18. Comparison of different opinion holder extraction systems