Hostname: page-component-77f85d65b8-hzqq2 Total loading time: 0 Render date: 2026-03-28T16:58:53.390Z Has data issue: false hasContentIssue false

Negative ties highlight hidden extremes in social media polarization

Published online by Cambridge University Press:  29 August 2025

E. Candellone*
Affiliation:
Department of Methodology and Statistics, Utrecht University, Utrecht, Netherlands Centre for Complex Systems Studies, Utrecht University, Utrecht, Netherlands
S. A. Babul
Affiliation:
Mathematical Institute, University of Oxford, Oxford, UK
Ö. Togay
Affiliation:
Department of Methodology and Statistics, Utrecht University, Utrecht, Netherlands
A. Bovet
Affiliation:
Department of Mathematical Modeling and Machine Learning, University of Zurich, Zurich, Switzerland Digital Society Initiative, University of Zurich, Zurich, Switzerland
J. Garcia-Bernardo
Affiliation:
Department of Methodology and Statistics, Utrecht University, Utrecht, Netherlands Centre for Complex Systems Studies, Utrecht University, Utrecht, Netherlands ODISSEI SoDa Team, Utrecht University, Utrecht, Netherlands
*
Corresponding author: Elena Candellone; Email: candellone.elena@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Human interactions in the online world comprise a combination of positive and negative exchanges. These diverse interactions can be captured using signed network representations, where edges take positive or negative weights to indicate the sentiment of the interaction between individuals. Signed networks offer valuable insights into online political polarization by capturing antagonistic interactions and ideological divides on social media platforms. This study analyzes polarization on Menéame, a Spanish social media platform that facilitates engagement with news stories through comments and voting. Using a dual-method approach—Signed Hamiltonian Eigenvector Embedding for Proximity for signed networks and Correspondence Analysis for unsigned networks—we investigate how including negative ties enhances the understanding of structural polarization levels across different conversation topics on the platform. While the unsigned Menéame network effectively delineates ideological communities, only by incorporating negative ties can we identify ideologically extreme users who engage in antagonistic behaviors: without them, the most extreme users remain indistinguishable from their less confrontational ideological peers.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Menéame platform. Schematic representation of one of the stores in the platform. Users can upvote and downvote stories, and upvote and downvote comments within the story. Downvoting stories is possible only for registered users through the “Report” button, while upvoting stories is allowed to everyone. Only registered users can vote for comments. Comments with many positive votes appear on the platform highlighted in orange.

Figure 1

Table 1. Number of stories, votes, upvotes, and downvotes per macro-topic in the dataset

Figure 2

Figure 2. Comparing news outlets’ attitudes toward the Russia-Ukraine war and politics. The main panels (A–B) display the embeddings for each news outlet, obtained by both positive and negative ties (SHEEP) and only positive ties (CA). The smaller panels (x-I) and (x-II) compare the two embedding techniques with the ideology retrieved from Twitter for a subset of news outlets. Colors represent the Twitter ideology in all panels, ranging from left-wing (brown) to right-wing (dark purple), while news outlets not classified are colored in gray. In the case of Russia (panel A), both CA and SHEEP identify the army-related news outlet Revista Ejércitos as an outlier, whereas only SHEEP distinguishes the pro-Russia news outlets Russia Today, Diario Octubre, and Actualidad RT from the left-leaning media such as ctxt or publico. Panel B shows that the two methods are highly correlated, both identifying ideology in accordance with Twitter (see panels B-I and B-II) when considering the politics topic. In contrast, panels A-I and A-II show that both SHEEP and CA are less correlated with Twitter ideology in the case of the Russia-Ukraine topic.

Figure 3

Figure 3. Visualization of user-to-user network for the Russia-Ukraine war topic. Both panels share the same layout, generated using the Fruchterman-Reingold force-directed algorithm, but not all the nodes appear on both networks. A random sample of 3,000 nodes is shown. Edges represent interactions: positive in blue, negative in red (only in SHEEP), and those with absolute weight smaller than 3 are filtered out. Node colors show standardized SHEEP (A) and CA (B) embedding values, capped at 2, with red indicating anti-NATO and blue pro-NATO attitudes. The layout reveals two ideological factions with notable cross-faction interaction. Note that for SHEEP (A), the most extreme users (darker shades of blue and red) are located within the network, while for CA (B), they tend to be the ones with a few votes and thus no visible edges. Panel C compares SHEEP and CA in determining user attitudes: each point is a user, with the color indicating their tendency to vote positively (dark blue) or negatively (red). Blue and red circles highlight users voting positively on revista ejércitos and Russia today, respectively. Histograms show user distribution across the embedding space for each method. Note that SHEEP identifies extreme negative voters in both factions, and the methods are strongly correlated, with a Spearman correlation of 88%.

Figure 4

Figure 4. Visualization of user-to-user network for the broad politics. Both panels share the same layout, generated using the Fruchterman-Reingold force-directed algorithm. A random sample of 3,000 nodes is shown. Edges represent interactions: positive in blue, negative in red (only in SHEEP), and those with absolute weight smaller than 5 are filtered out. Node colors show standardized SHEEP (A) and CA (B) embedding values, capped at 2. The layout reveals two ideological factions with notable cross-faction interaction. Note that for SHEEP (A), the most extreme users (darker shades of blue and red) are located within the network, while for CA (B), the most extreme users are those with few votes (and thus have no edges visible). Panel C compares SHEEP and CA in determining user attitudes towards politics. Each point represents a user, with color indicating their tendency to vote positively (dark blue) or negatively (red). Red circles are users who vote positively for far-left media. The histograms show the distribution of users across the embedding space for each method. We note that SHEEP places extremely negative voters in the left faction, while in general, the two methods are consistent in their identification of most users, with a Spearman correlation of 80%.

Figure 5

Figure 5. Comparison of the structural positioning derived from two embedding methods: (A) SHEEP and (B) Correspondence Analysis (CA), with ideological positioning. The x-axis represents the binned structural positioning obtained from the embeddings, while the y-axis indicates the average vote of users in each bin toward stories (using the bipartite network described in the previous section). Votes are weighted by the average number of votes per domain and the sign of those votes (see Methods). Points represent ideological bins, with a dashed gray line showing a smoothed regression (lowess) to highlight trends. Positive y-values indicate a higher propensity for and positivity in voting for right-wing news outlets, or a lower propensity for and positivity in voting for left-wing news outlets. Note that CA captures political ideology more linearly compared to SHEEP, which exhibits a non-linear pattern.

Figure 6

Table A1. Cross-tabulation of binned average vote scores (Topic: Russia-Ukraine war)

Figure 7

Table A2. Cross-tabulation of binned average vote scores (Topic: Broad Politics)

Figure 8

Figure A1. Variation of the number of outliers, the number of topics, and the coherence, respectively varying the parameters $\epsilon$ and min sample. As we would like a situation with not many outliers and a "reasonable" number of topics, we choose the following values for the parameters: $\text{min_sample}=1$ and $\text{cluster_selection_epsilon}=3 \times 10^{-6}$.

Figure 9

Table A3. Topics obtained with BERTopic before any outlier reduction

Figure 10

Table A4. Topics obtained with hSBM

Figure 11

Figure A2. Confusion matrix with colors indicating the overlap between topics identified by BERTopic and hSBM.

Figure 12

Figure A3. Visualization of the first two components of the Correspondence Analysis for both news outlets (dark purple) and political parties (yellow). We interpret the first dimension as left-right ideology, and the second dimension (not used in this paper) as mainstream-radical.

Figure 13

Figure A4. Comparison of the ideology of media outlets and Twitter accounts. The x-axis represents the ideology of media outlets as reported by Political Watch, while the y-axis represents the ideology of Twitter accounts. The dashed line represents the regression line.

Figure 14

Figure A5. Analyzing users’ views on Russia-Ukraine war. Panels A and B show the distribution of users across the embedding space for SHEEP and CA, respectively. Bar colors reflect k-means clustering (see Appendix A.6). Note that histogram scales differ from Figure 3 as the bins have uniform size. Panels C and D display heatmaps of normalized vote probabilities, with rows representing voters’ attitudes and columns representing the attitudes of users they vote on. Note that voting tends to occur between users with similar attitudes (votes often lie close to the diagonal). Panels E and F show average votes cast on stories and comments, ranging from −1 (all negative) to +1 (all positive). Users generally vote positively, except for extreme users, who downvote the opposite extreme. Clusters from k-means (matching panel A/B colors) are shown as boxes: anti-NATO (blue, dark purple), moderate (gray, pink), and pro-NATO (yellow). See Appendix A.7 for details on binning, vote normalization, and cluster interpretation.

Figure 15

Figure A6. Analyzing users’ views on broad politics. Panel A and B show again the distribution of users across the embedding space, for SHEEP and CA, respectively. Bar colors reflect k-means clustering(see Appendix A.6). Note that histogram scales differ from Figure 4 as the bins have uniform size. Panels C and D display heatmaps of normalized vote probabilities, with rows representing voters’ attitudes and columns representing the attitudes of users they vote on the users they vote on. Panels E and F show average votes cast on stories and comments, ranging from −1 (all negative) to +1 (all positive). Users generally vote positively, except for extreme left-wing users, who vote negatively towards the opposite extreme. Clusters from k-means (matching panel A/B colors) are shown as boxes: far-left (dark blue, dark purple), left-wing (light blue, pink), right-wing (gray, orange), and far-right (yellow). See Appendix A.7 for details on binning, vote normalization, and cluster interpretation.

Figure 16

Figure A7. User-to-user network embeddings on Russia topic. Comparing user embeddings generated by CA, SHEEP, and SHEEP null model. As in Figure 3, the color indicates the tendency of the user to vote positively (dark blue) or negatively (red). The Pearson correlation for SHEEP vs CA is 0.68, SHEEP null model vs CA is 0.83, and SHEEP null model vs SHEEP is 0.61.

Figure 17

Figure A8. User-to-user network embeddings on politics topic. Comparing user embeddings generated by CA, SHEEP, and SHEEP null model. As in Figure 4, the color indicates the tendency of the user to vote positively (dark blue) or negatively (red). The Pearson correlation for SHEEP vs CA is 0.56, SHEEP null model vs CA is 0.94, and SHEEP null model vs SHEEP is 0.54.