NCSAC: Effective Neural Community Search via Attribute-augmented Conductance
Longlong Lin, Quanao Li, Miao Qiao, Zeli Wang, Jin Zhao, Rong-Hua Li, Xin Luo, Tao Jia
TL;DR
This work tackles neural community search on attributed graphs by bridging traditional rule-based constraints with deep learning. It introduces attribute-augmented conductance, blending topology and attribute similarity to obtain a high-quality coarse candidate, then refines this candidate via a PPO-based reinforcement learning framework that learns when and how to add or remove nodes. Key contributions include the formalization and efficient computation of $(C)$, an adaptive extractor with $O(m+nk)$ complexity, a community-aware state encoder with contrastive and triplet losses, and an RL-based refiner with flexible termination and a reward shaping mechanism. Empirical results on six real graphs show consistent F1-score improvements (up to $42.4\%$) over ten baselines, establishing NCSAC as a scalable and effective solution for neural community search.
Abstract
Identifying locally dense communities closely connected to the user-initiated query node is crucial for a wide range of applications. Existing approaches either solely depend on rule-based constraints or exclusively utilize deep learning technologies to identify target communities. Therefore, an important question is proposed: can deep learning be integrated with rule-based constraints to elevate the quality of community search? In this paper, we affirmatively address this question by introducing a novel approach called Neural Community Search via Attribute-augmented Conductance, abbreviated as NCSAC. Specifically, NCSAC first proposes a novel concept of attribute-augmented conductance, which harmoniously blends the (internal and external) structural proximity and the attribute similarity. Then, NCSAC extracts a coarse candidate community of satisfactory quality using the proposed attribute-augmented conductance. Subsequently, NCSAC frames the community search as a graph optimization task, refining the candidate community through sophisticated reinforcement learning techniques, thereby producing high-quality results. Extensive experiments on six real-world graphs and ten competitors demonstrate the superiority of our solutions in terms of accuracy, efficiency, and scalability. Notably, the proposed solution outperforms state-of-the-art methods, achieving an impressive F1-score improvement ranging from 5.3\% to 42.4\%. For reproducibility purposes, the source code is available at https://github.com/longlonglin/ncsac.
