Hostname: page-component-89b8bd64d-rbxfs Total loading time: 0 Render date: 2026-05-08T17:39:51.131Z Has data issue: false hasContentIssue false

Identification and summarisation of events from Twitter using clustering algorithms and deep neural network

Published online by Cambridge University Press:  13 September 2024

Kunal Chakma
Affiliation:
Computer Science and Engineering Department, National Institute of Technology, Agartala, Tripura, India
Anupam Jamatia*
Affiliation:
Computer Science and Engineering Department, National Institute of Technology, Agartala, Tripura, India
Dwijen Rudrapal
Affiliation:
Computer Science and Engineering Department, National Institute of Technology, Agartala, Tripura, India
*
Corresponding author: Anupam Jamatia; Email: anupamjamatia@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

The proliferation of social networks has caused an increase in the amount of textual content generated by users. The voluminous nature of such content poses a challenge to users, necessitating the development of technological solutions for automatic summarisation. This paper presents a two-stage framework for generating abstractive summaries from a collection of Twitter texts. In the first stage of the framework, event detection is carried out through clustering, followed by event summarisation in the second stage. Our approach involves generating contextualised vector representations of tweets and applying various clustering techniques to the vectors. The quality of the resulting clusters is evaluated, and the best clusters are selected for the summarisation task based on this evaluation. In contrast to previous studies, we experimented with various clustering techniques as a preprocessing step to obtain better event representations. For the summarisation task, we utilised pre-trained models of three state-of-the-art deep neural network architectures and evaluated their performance on abstractive summarisation of the event clusters. Summaries are generated from clusters that contain (a) unranked tweets, (b) all ranked tweets, and (c) the top 10 ranked tweets. Of these three sets of clusters, we obtained the best ROUGE scores from the top 10 ranked tweets. From the summaries generated from the clusters containing the top ten tweets, we obtained ROUGE-1 F score of 48%, ROUGE-2 F score of 37%, ROUGE-L F score of 44%, and ROUGE-SU F score of 33% which suggests that if relevant tweets are at the top of a cluster, and then better summaries are generated.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Framework of the proposed system.

Figure 1

Figure 2. 5W1H BIO sequence labelling example.

Figure 2

Figure 3. Contextualised vector generation.

Figure 3

Figure 4. Cluster selection process for the summarisation task.

Figure 4

Figure 5. Selection of best value for number of clusters $k$ for k-means and HAC.

Figure 5

Figure 6. Visualisation of generated clusters.

Figure 6

Table 1. A snapshot of the cluster of tweets labelled by k-means, HAC, HDBSCAN, and AP

Figure 7

Figure 7. Population distribution of clusters by k-means, HAC, AP, and HDBSCAN. (a) Ten clusters generated by k-means clustering with the distribution of 1,466, 2,280, 1,073, 1,481, 2,073, 834, 1,615, 1,768, 2,117, and 1,668 tweets. (b) Ten clusters generated by HAC clustering with the distribution of 2,267, 2,170, 3,004, 594, 3,928, 1,868, 100, 866, 385, and 293 tweets. (c) A total 934 clusters were generated by AP with minimum and maximum cluster size of 2 and 115 tweets, respectively. (d) HDBSCAN failed to cluster 10,057 tweets which is 61.4% of the dataset. The remaining 38.6% are clustered into 2,356 clusters with length between 2 and 23 tweets.

Figure 8

Table 2. Average ROUGE F-1 score for candidate summary similarity of Lexrank with other methods

Figure 9

Figure 8. Gold reference summary creation process for the summarisation task.

Figure 10

Figure 9. n-gram distribution in a particular cluster.

Figure 11

Figure 10. Similarity of tweets under a particular cluster by AP.

Figure 12

Figure 11. ROUGE Scores—Precision, Recall, and F-score for ROUGE-1, ROUGE-2, ROUGE-L, and ROUGE-SU.