DIST: Efficient k-Clique Listing via Induced Subgraph Trie

Yehyun Nam; Jihoon Jang; Kunsoo Park; Jianye Yang; Cheng Long

DIST: Efficient k-Clique Listing via Induced Subgraph Trie

Yehyun Nam, Jihoon Jang, Kunsoo Park, Jianye Yang, Cheng Long

TL;DR

This work tackles the challenging problem of listing all $k$-cliques in large graphs. It introduces the Induced Subgraph Trie to memoize and efficiently retrieve cliques, coupled with a pruning mechanism based on soft embeddings of $l$-trees and a density-aware ListingDense routine for dense subgraphs. Empirical results on 16 real networks show DIST substantially outperforms state-of-the-art methods in both running time and memory usage, including notable gains on graphs with large maximum clique sizes and in parallel execution. The approach enables scalable, exact enumeration of $k$-cliques and offers a practical memory management strategy, suggesting broad applicability to cohesive subgraph mining tasks.

Abstract

Listing k-cliques plays a fundamental role in various data mining tasks, such as community detection and mining of cohesive substructures. Existing algorithms for the k-clique listing problem are built upon a general framework, which finds k-cliques by recursively finding (k-1)-cliques within subgraphs induced by the out-neighbors of each vertex. However, this framework has inherent inefficiency of finding smaller cliques within certain subgraphs repeatedly. In this paper, we propose an algorithm DIST for the k-clique listing problem. In contrast to existing works, the main idea in our approach is to compute each clique in the given graph only once and store it into a data structure called Induced Subgraph Trie, which allows us to retrieve the cliques efficiently. Furthermore, we propose a method to prune search space based on a novel concept called soft embedding of an l-tree, which further improves the running time. We show the superiority of our approach in terms of time and space usage through comprehensive experiments conducted on real-world networks; DIST outperforms the state-of-the-art algorithm by up to two orders of magnitude in both single-threaded and parallel experiments.

DIST: Efficient k-Clique Listing via Induced Subgraph Trie

TL;DR

Abstract

DIST: Efficient k-Clique Listing via Induced Subgraph Trie

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (19)