R(3)MAT: A Rapid and Robust Graph Generator

Angles, Renzo; Paredes, Rodrigo; Garcia, Roberto

Abstract

One of the main problems when developing graph-based applications is the availability of large and representative datasets. The lack of real graphs has motivated the development of software tools for generating synthetic graphs. R-MAT is a data generation method that was designed to produce synthetic graphs whose characteristics resemble those occurring in real networks. Although the generation model defined by R-MAT is easy to understand, its implementation is not trivial and it has intrinsic memory restrictions that makes the generation of very large graphs difficult. This paper studies the practical implementation of R-MAT. We discuss the issues of the original implementation which works with the adjacency matrix of the graph and analyze the size of the resulting graph obtained with the R-MAT model. Then, we introduce and experimentally evaluate R(3)MAT, an alternative implementation for R-MAT based on an array of degrees. These experiments show that (i) our R(3)MAT is able to generate graphs of hundred million nodes and billion edges in a single machine, (ii) our method preserves the characteristic power-law distribution of the edge degrees present in real-world graphs, and (iii) R(3)MAT has the best performance in the current state of the art, when considering a single modest computer in a sequential fashion.

Más información

Título según WOS: R(3)MAT: A Rapid and Robust Graph Generator
Título de la Revista: IEEE ACCESS
Volumen: 8
Editorial: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Fecha de publicación: 2020
Página de inicio: 130048
Página final: 130065
DOI:

10.1109/ACCESS.2020.3009577

Notas: ISI