The increasing importance of graph data in various fields makes the efficient processing of large-scale graph data highly necessary where well-balanced graph partitioning is a vital component of parallel/distributed graph processing. The goal of graph partitioning is to obtain a well-balanced graph topology that balances the size of each partition while reducing the number of edge-cuts. Moreover, the graph partitioning algorithm should achieve high performance and scalability.
In this paper, we present a novel graph-partitioning algorithm that ensures low edge-cuts and high-performance processing capability for parallel processing. Based on the label propagation algorithm, we propose the formulas to improve the degree of edge-cuts and to achieve fast convergence. By removing the necessity of processing the label propagation for all nodes, our approach processes the label propagation of candidate nodes based on a proposed score metric.
Our proposed algorithm introduces a stabilization phase in which remote and highly connected nodes are relocated to avoid the algorithm becoming trapped around local optima. Comparison results show that a prototype based on the proposed algorithm outperforms other well-known parallel graph-partitioning frameworks in terms of speed and balance.