Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-07T13:48:21.097Z Has data issue: false hasContentIssue false

Graph representation learning: a survey

Published online by Cambridge University Press:  28 May 2020

Fenxiao Chen*
Affiliation:
University of Southern California, Los Angeles, CA90089, USA
Yun-Cheng Wang
Affiliation:
University of Southern California, Los Angeles, CA90089, USA
Bin Wang
Affiliation:
University of Southern California, Los Angeles, CA90089, USA
C.-C. Jay Kuo
Affiliation:
University of Southern California, Los Angeles, CA90089, USA
*
Corresponding author: Fenxiao Chen Email: fenxiaoc@usc.edu

Abstract

Research on graph representation learning has received great attention in recent years since most data in real-world applications come in the form of graphs. High-dimensional graph data are often in irregular forms. They are more difficult to analyze than image/video/audio data defined on regular lattices. Various graph embedding techniques have been developed to convert the raw graph data into a low-dimensional vector representation while preserving the intrinsic graph properties. In this review, we first explain the graph embedding task and its challenges. Next, we review a wide range of graph embedding techniques with insights. Then, we evaluate several stat-of-the-art methods against small and large data sets and compare their performance. Finally, potential applications and future directions are presented.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2020 published by Cambridge University Press in association with Asia Pacific Signal and Information Processing Association
Figure 0

Fig. 1. Illustration of graph representation learning input and output.

Figure 1

Fig. 2. Illustration of a LGCL method [29].

Figure 2

Fig. 3. Illustration of the sub-graph selection process [29].

Figure 3

Fig. 4. The architecture of HGNN [73].

Figure 4

Table 1. Comparison of properties of graphs and hyper-graphs

Figure 5

Fig. 5. Illustration of graph and hypergraph structures [73].

Figure 6

Table 2. Summary of different graph embedding methods

Figure 7

Table 3. Summary of representative graph data sets

Figure 8

Table 4. Performance comparison of nine common graph embedding methods in vertex classification on Cora, Citeseer, and Wiki

Figure 9

Fig. 6. t-SNE visualization of different embedding methods on Cora. The seven different colors of points represent different classes of the nodes.

Figure 10

Table 5. Comparison of time used in training (s)

Figure 11

Table 6. Comparison of clustering quality of six graph embedding methods in terms of macro- and micro-F1 scores against three large graph data sets

Figure 12

Fig. 7. The node classification accuracy as a function of the embedding size.

Figure 13

Fig. 8. The node classification accuracy as a function of the embedding size.