Hostname: page-component-89b8bd64d-n8gtw Total loading time: 0 Render date: 2026-05-08T06:54:44.344Z Has data issue: false hasContentIssue false

A multi-purposed unsupervised framework for comparing embeddings of undirected and directed graphs

Published online by Cambridge University Press:  28 September 2022

Bogumił Kamiński
Affiliation:
Decision Analysis and Support Unit, SGH Warsaw School of Economics, Warsaw, Poland
Łukasz Kraiński
Affiliation:
Decision Analysis and Support Unit, SGH Warsaw School of Economics, Warsaw, Poland
Paweł Prałat*
Affiliation:
Department of Mathematics, Toronto Metropolitan University, Toronto, ON, Canada
François Théberge
Affiliation:
Tutte Institute for Mathematics and Computing, Ottawa, ON, Canada
*
*Corresponding author. Email: pralat@ryerson.ca
Rights & Permissions [Opens in a new window]

Abstract

Graph embedding is a transformation of nodes of a network into a set of vectors. A good embedding should capture the underlying graph topology and structure, node-to-node relationship, and other relevant information about the graph, its subgraphs, and nodes themselves. If these objectives are achieved, an embedding is a meaningful, understandable, and often compressed representation of a network. Unfortunately, selecting the best embedding is a challenging task and very often requires domain experts. In this paper, we extend the framework for evaluating graph embeddings that was recently introduced in [15]. Now, the framework assigns two scores, local and global, to each embedding that measure the quality of an evaluated embedding for tasks that require good representation of local and, respectively, global properties of the network. The best embedding, if needed, can be selected in an unsupervised way, or the framework can identify a few embeddings that are worth further investigation. The framework is flexible and scalable and can deal with undirected/directed and weighted/unweighted graphs.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/) which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Figure 1. Normalized global and local scores for FOOTBALL graph.

Figure 1

Figure 2. Two-dimensional projections of embeddings of FOOTBALL graph according to the framework: the best (left), the worst with respect to the local score (middle), and the worst with respect to the global score (right).

Figure 2

Figure 3. Approximated vs. exact global scores for LFR graphs and HOPE (left) and Node2Vec (right) embeddings.

Figure 3

Figure 4. Approximated vs. exact local scores for LFR graphs and HOPE (left) and Node2Vec (right) embeddings.

Figure 4

Table 1. Correlation between approximated and (exact) global scores

Figure 5

Table 2. Correlation between approximated and (exact) local scores

Figure 6

Figure 5. Pearson’s correlation between approximated and exact global scores for SMB, LFR, noisy-LFR graphs, and HOPE (left) and Node2Vec (right) embeddings.

Figure 7

Figure 6. Pearson’s correlation between approximated and exact local scores for SMB, LFR, noisy-LFR graphs, and HOPE (left) and Node2Vec (right) embeddings.

Figure 8

Figure 7. Global and local scores with accuracy (left) and AMI (right) overlay for SBM, LFR, noisy-LFR, EMAIL graphs, and HOPE and Node2Vec embeddings.

Figure 9

Table 3. Correlation coefficients between the accuracy scores in node classification task and global/local scores (averaged over all parameters)

Figure 10

Table 4. Correlation coefficients between the AMI scores in community detection task and global/local scores (averaged over all parameters)

Figure 11

Figure 8. Global and local scores with AUC overlay for SBM, LFR, noisy-LFR, EMAIL graphs, and HOPE and Node2Vec embeddings.

Figure 12

Table 5. Correlation coefficients between the AUC scores in link prediction task and global/local scores (averaged over all parameters)

Figure 13

Figure 9. Comparison of local/global scores (and associated quality measures for various tasks) for rewired graphs (left) and rescaled embeddings (right).

Supplementary material: File

Kamiński et al. supplementary material

Kamiński et al. supplementary material 1
Download Kamiński et al. supplementary material(File)
File 2.9 KB
Supplementary material: PDF

Kamiński et al. supplementary material

Kamiński et al. supplementary material 2
Download Kamiński et al. supplementary material(PDF)
PDF 6 MB