Loading…
Friday, June 24 • 9:25am - 9:50am
Load the Edges You Need: A Generic I/O Optimization for Disk-based Graph Processing

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Single-PC, disk-based processing of big graphs has recently gained much popularity. At the core of an efficient disk-based system is a well-designed partition structure that can minimize random disk accesses. All existing systems use static partitions that are created before processing starts. These partitions have static layouts and are loaded entirely into memory in every single iteration even though much of the edge data is not changed across many iterations, causing these unchanged edges to have zero new impact on the computation of vertex values.

This work provides a general optimization that removes this I/O inefficiency by employing dynamic partitions whose layouts are dynamically adjustable. Our implementation of this optimization in GraphChi — a representative out-of-core vertex-centric graph system — yielded speedups of 1.5—2.8× on six large graphs. Our idea is generally applicable to other systems as well.

Friday June 24, 2016 9:25am - 9:50am MDT
Denver Marriott City Center 1701 California Street, Denver, CO 80202

Attendees (2)