Sublinear Random Access Generators for Preferential Attachment Graphs

📅 2016-02-19

🏛️ International Colloquium on Automata, Languages and Programming

📈 Citations: 14

✨ Influential: 3

career value

246K/year

🤖 AI Summary

Efficiently supporting edge and adjacency queries on large-scale Barabási–Albert (BA) graphs (with out-degree one) and random recursive trees without storing or pre-generating the entire graph. Method: We propose the first on-demand, sublinear random-access framework, built upon probabilistic preprocessing, hierarchical indexing, and inverse sampling. It constructs a deterministic auxiliary structure and employs a lightweight online sampling algorithm. Results: Our framework guarantees polylogarithmic time, space, and random-bit complexity per query—i.e., $ ext{polylog}(n) $—while ensuring that query outputs are *exactly* distributed according to the standard BA model or random recursive tree. With probability $ 1 - 1/ ext{poly}(n) $, it enables faithful simulation of sublinear-time graph algorithms and accurate estimation of graph properties, significantly reducing computational, storage, and randomness overheads compared to full-graph approaches.

📝 Abstract

We consider the problem of sampling from a distribution on graphs, specifically when the distribution is defined by an evolving graph model, and consider the time, space, and randomness complexities of such samplers. In the standard approach, the whole graph is chosen randomly according to the randomized evolving process, stored in full, and then queries on the sampled graph are answered by simply accessing the stored graph. This may require prohibitive amounts of time, space, and random bits, especially when only a small number of queries are actually issued. Instead, we propose a setting where one generates parts of the sampled graph on-the-fly, in response to queries, and therefore requires amounts of time, space, and random bits that are a function of the actual number of queries. Yet, the responses to the queries correspond to a graph sampled from the distribution in question. Within this framework, we focus on two random graph models: the Barabási-Albert Preferential Attachment model (BA-graphs) (Science, 286 (5439):509–512) (for the special case of out-degree 1) and the random recursive tree model (Theory of Probability and Mathematical Statistics, (51):1–28). We give on-the-fly generation algorithms for both models. With probability 1-1/poly(n), each and every query is answered in polylog(n) time, and the increase in space and the number of random bits consumed by any single query are both polylog(n), where n denotes the number of vertices in the graph. Our work thus proposes a new approach for the access to huge graphs sampled from a given distribution, and our results show that, although the BA random graph model is defined by a sequential process, efficient random access to the graph’s nodes is possible. In addition to the conceptual contribution, efficient on-the-fly generation of random graphs can serve as a tool for the efficient simulation of sublinear algorithms over large BA-graphs, and the efficient estimation of their on such graphs.

Problem

Research questions and friction points this paper is trying to address.

Efficiently sampling large preferential attachment graphs on-the-fly

Reducing time, space, and randomness costs for graph queries

Enabling random access to nodes in sequentially-defined graph models

Innovation

Methods, ideas, or system contributions that make the work stand out.

On-the-fly graph generation for queries

Polylog time per query with high probability

Reduced space and randomness requirements

🔎 Similar Papers

Random Walk Diffusion for Efficient Large-Scale Graph Generation