Linear work generation of R-MAT graphs

Lorenz Hübschle-Schneider; Peter Sanders

doi:10.1017/nws.2020.21

Linear work generation of R-MAT graphs

Published online by Cambridge University Press: 29 May 2020

Lorenz Hübschle-Schneider and

Peter Sanders

Show author details

Lorenz Hübschle-Schneider: Affiliation:
Karlsruhe Institute of Technology, 76128 Karlsruhe, Germany (e-mail: huebschle@kit.edu)
Peter Sanders*: Affiliation:
Karlsruhe Institute of Technology, 76128 Karlsruhe, Germany (e-mail: huebschle@kit.edu)
*: *Corresponding author. Email: sanders@kit.edu

Article contents

Abstract
Footnotes
References

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

R-MAT (for Recursive MATrix) is a simple, widely used model for generating graphs with a power law degree distribution, a small diameter, and communitys structure. It is particularly attractive for generating very large graphs because edges can be generated independently by an arbitrary number of processors. However, current R-MAT generators need time logarithmic in the number of nodes for generating an edge— constant time for generating one bit at a time for node IDs of the connected nodes. We achieve constant time per edge by precomputing pieces of node IDs of logarithmic length. Using an alias table data structure, these pieces can then be sampled in constant time. This simple technique leads to practical improvements by an order of magnitude. This further pushes the limits of attainable graph size and makes generation overhead negligible in most situations.

Keywords

graph generator parallel processing large graphs bit parallelism sampling

Information

Type: Research Article
Information: Network Science , Volume 8 , Issue 4 , December 2020 , pp. 543 - 550

DOI: https://doi.org/10.1017/nws.2020.21 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Author(s), 2020. Published by Cambridge University Press

Footnotes

Action Editor: Ulrik Brandes

References

Barabasi, A.-L., & Albert, R. (1999). Emergence of scaling in random networks. Science, 286(5439), 509–512.CrossRef Google Scholar PubMed

Bläsius, T., Friedrich, T., Katzmann, M., Meyer, U., Penschuck, M., & Weyand, C. (2019). Efficiently generating geometric inhomogeneous and hyperbolic random graphs. In 27th European symposium on algorithms (ESA) (pp. 21:1–21:14).Google Scholar

Chakrabarti, D., Zhan, Y., & Faloutsos, C. (2004). R-MAT: A recursive model for graph mining. In SIAM conference on data mining (SDM) (pp. 442–446). SIAM.CrossRef Google Scholar

Drobyshevskiy, M., & Turdakov, D. (2020). Random graph modeling. ACM Computing Surveys, 52(6), 131:1–131:36.Google Scholar

Funke, D., Lamm, S., Meyer, U., Penschuck, M., Sanders, P., Schulz, C., …, von Looz, M. (2019). Communication-free massively distributed graph generation. Journal of Parallel and Distributed Computing, 131, 200–217.CrossRef Google Scholar

Goldenberg, A., Zheng, A. X, Fienberg, S. E., & Airoldi, E. M. (2010). A survey of statistical network models. Foundations and Trends in Machine Learning, 2(2), 129–233.CrossRef Google Scholar

Hübschle-Schneider, L., & Sanders, P. (2019a). Linear work generation of R-MAT graphs. arXiv/CoRR,arXiv:1905.03525.CrossRef Google Scholar

Hübschle-Schneider, L., & Sanders, P. (2019b). Parallel weighted random sampling. In 27th European symposium on algorithms (ESA) (pp. 59:1–59:24). LIPIcs.Google Scholar

Krioukov, D., Papadopoulos, F., Kitsak, M., Vahdat, A., & Boguná, M. (2010). Hyperbolic geometry of complex networks. Physical Review E, 82(3), 036106-1-036106-18.CrossRef Google Scholar PubMed

Lancichinetti, A., Fortunato, S., & Radicchi, F. (2008). Benchmark graphs for testing community detection algorithms. Physical Review E, 78(4), 046110.CrossRef Google Scholar PubMed

Leskovec, J., Chakrabarti, D., Kleinberg, J., Faloutsos, C., & Ghahramani, Z. (2010). Kronecker graphs: An approach to modeling networks. Journal of Machine Learning Research, 11, 985–1042.Google Scholar

Leskovec, J., & Faloutsos, C. (2007). Scalable modeling of real graphs using Kronecker multiplication. In 24th international conference on machine learning (ICML) (pp. 497–504). ACM.CrossRef Google Scholar

Mahdian, M., & Xu, Y. (2007). Stochastic kronecker graphs. In 5th international workshop on algorithms and models for the web-graph (WAW) (pp. 179–186). Springer.CrossRef Google Scholar

Moreno, S., Pfeiffer, J. J., & Neville, J. (2018). Scalable and exact sampling method for probabilistic generative graph models. Data Mining and Knowledge Discovery, 32(6), 1561–1596.CrossRef Google Scholar

Murphy, R. C., Wheeler, K. B., Barrett, B. W., & Ang, J. A. (2010). Introducing the Graph 500. Cray user’s group.Google Scholar

Penschuck, M., Brandes, U., Hamann, M., Lamm, S., Meyer, U., Safro, I., …, Schulz , C. (2020). Recent advances in scalable network generation. arXiv/CoRR,arXiv:2003.00736.Google Scholar

Sanders, P., & Schulz, C. (2016). Scalable generation of scale-free graphs. Information Processing Letters, 116(7), 489–491.CrossRef Google Scholar

Staudt, C. L., Sazonovs, A., & Meyerhenke, H. (2016). NetworKit: A tool suite for large-scale complex network analysis. Network Science, 4(4), 508–530.CrossRef Google Scholar

Vose, M. D. (1991). A linear algorithm for generating random numbers with a given distribution. IEEE Transactions on Software Engineering (TSE), 17(9), 972–975.CrossRef Google Scholar

Walker, A. J. (1977). An efficient method for generating discrete random variables with general distributions. ACM Transactions on Mathematical Software (TOMS), 3(3), 253–256.CrossRef Google Scholar

Watts, D. J., & Strogatz, S. H. (1998). Collective dynamics of ‘small-world’ networks. Nature, 393(6684), 440.CrossRef Google Scholar PubMed

Article contents

Linear work generation of R-MAT graphs

Abstract

Keywords

Information

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests