OPTiC: Opportunistic Graph Processing in Multi-Tenant Clusters

  • Muntasir Raihan Rahman ,
  • Indranil Gupta ,
  • Akash Kapoor ,
  • Haozhen Ding

2018 IEEE International Conference on Cloud Engineering (IC2E) |

Published by IEEE | Organized by IEEE

We present OPTiC, a multi-tenant scheduler intended for distributed graph processing frameworks. OPTiC proposes opportunistic scheduling, whereby queued jobs can be pre-scheduled at cluster nodes when the cluster is fully busy running jobs. This allows overlapping of data ingress with ongoing computation. To pre-schedule wisely, OPTiC’s novel contribution is a profile-free and cluster-agnostic approach to compare progress of graph processing jobs. OPTiC is implemented inside Apache Giraph, with YARN underneath. Our experiments with real workload traces and network models show that OPTiC’s opportunistic scheduling improves run time (both at the median and at the tail) by 20%-82% compared to baseline multi-tenancy, in a variety of scenarios.