Table of Contents
Fetching ...

Balanced Partitioning for Optimizing Big Graph Computation: Complexities and Approximation Algorithms

Baoling Ning, Jianzhong Li

TL;DR

The paper addresses graph partitioning tailored to big-graph computation by introducing workload-driven (W$k$BGP) and motif-driven (M$k$BGP) objectives. It develops semidefinite programming representations to capture partitioning structure and applies sophisticated rounding to obtain bi-criteria $O(\sqrt{ ext{log} n ext{ log } k})$-approximation algorithms. It proves NP-hardness and inapproximability for motif-based partitioning (even with triangles) while delivering a tractable SDP-based approach for the triangle case and extending it to general motifs. Together, these results yield principled partitioning methods with guarantees that improve workload performance and motif computation on large-scale graphs.

Abstract

Graph partitioning is a key fundamental problem in the area of big graph computation. Previous works do not consider the practical requirements when optimizing the big data analysis in real applications. In this paper, motivated by optimizing the big data computing applications, two typical problems of graph partitioning are studied. The first problem is to optimize the performance of specific workloads by graph partitioning, which lacks of algorithms with performance guarantees. The second problem is to optimize the computation of motifs by graph partitioning, which has not been focused by previous works. First, the formal definitions of the above two problems are introduced, and the semidefinite programming representations are also designed based on the analysis of the properties of the two problems. For the motif based partitioning problem, it is proved to be NP-complete even for the special case of $k=2$ and the motif is a triangle, and its inapproximability is also shown by proving that there are no efficient algorithms with finite approximation ratio. Finally, using the semidefinite programming and sophisticated rounding techniques, the bi-criteria $O(\sqrt{\log n\log k})$-approximation algorithms with polynomial time cost are designed and analyzed for them.

Balanced Partitioning for Optimizing Big Graph Computation: Complexities and Approximation Algorithms

TL;DR

The paper addresses graph partitioning tailored to big-graph computation by introducing workload-driven (WBGP) and motif-driven (MBGP) objectives. It develops semidefinite programming representations to capture partitioning structure and applies sophisticated rounding to obtain bi-criteria -approximation algorithms. It proves NP-hardness and inapproximability for motif-based partitioning (even with triangles) while delivering a tractable SDP-based approach for the triangle case and extending it to general motifs. Together, these results yield principled partitioning methods with guarantees that improve workload performance and motif computation on large-scale graphs.

Abstract

Graph partitioning is a key fundamental problem in the area of big graph computation. Previous works do not consider the practical requirements when optimizing the big data analysis in real applications. In this paper, motivated by optimizing the big data computing applications, two typical problems of graph partitioning are studied. The first problem is to optimize the performance of specific workloads by graph partitioning, which lacks of algorithms with performance guarantees. The second problem is to optimize the computation of motifs by graph partitioning, which has not been focused by previous works. First, the formal definitions of the above two problems are introduced, and the semidefinite programming representations are also designed based on the analysis of the properties of the two problems. For the motif based partitioning problem, it is proved to be NP-complete even for the special case of and the motif is a triangle, and its inapproximability is also shown by proving that there are no efficient algorithms with finite approximation ratio. Finally, using the semidefinite programming and sophisticated rounding techniques, the bi-criteria -approximation algorithms with polynomial time cost are designed and analyzed for them.
Paper Structure (20 sections, 17 theorems, 15 equations, 4 figures, 1 table, 2 algorithms)

This paper contains 20 sections, 17 theorems, 15 equations, 4 figures, 1 table, 2 algorithms.

Key Result

theorem 1

The $k$BGP problem is $\textsf{NP}$-complete, even for the special case $k=2$. ∎

Figures (4)

  • Figure 1: An Example of Balanced Graph Partitioning Driven by Workload
  • Figure 2: An Example of Balanced Graph Partitioning Driven by Motif Computation
  • Figure 3: An Example of $\textsc{P}$-time reduction used in the proof of Theorem \ref{['theorem:graph:MkBGPNPCTriangle']}
  • Figure 4: An Illustration Example of $F(x)$ used in the proof of Theorem \ref{['theorem:graph:MkBGPNPCTriangle']}

Theorems & Definitions (32)

  • definition 1: the partition and $k$-partition solution of a graph
  • definition 2: $k$-balanced partition solution of a graph
  • definition 3: $k$-balanced graph partitioning, $k$BGP for short
  • theorem 1: Garey79ComputersIntractablity
  • theorem 2: Andreev06BalancedGraph
  • definition 4: the $(k,\tau)$-balanced partition solution of a graph
  • definition 5: The W$k$BGP Problem
  • proposition 1
  • proposition 2
  • definition 6: The M$k$BGP problem
  • ...and 22 more