A Sketch-Based Distance Oracle for Web-Scale Graphs

  • Sreenivas Gollapudi ,
  • Marc Najork ,
  • Rina Panigrahy ,
  • Atish Das Sarma

3rd ACM International Conference on Web Search and Data Mining (WSDM) |

Published by Association for Computing Machinery, Inc.

We study the fundamental problem of computing distances between nodes in large graphs such as the web graph and social networks. Our objective is to be able to answer distance queries between pairs of nodes in real time. Since the standard shortest path algorithms are expensive, our approach moves the time-consuming shortest-path computation offline, and at query time only looks up precomputed values and performs simple and fast computations on these precomputed values. More specifically, during the offline phase we compute and store a small “sketch” for each node in the graph, and at query-time we look up the sketches of the source and destination nodes and perform a simple computation using these two sketches to estimate the distance.