Table of Contents
Fetching ...

An Efficient Local Search Approach for Polarized Community Discovery in Signed Networks

Linus Aronsson, Morteza Haghir Chehreghani

TL;DR

This work tackles polarized community discovery in signed networks by allowing a neutral vertex set $S_0$ and proposing a CC-based objective $f({\bm{x}})=(N^+_{\text{intra}}-N^-_{\text{intra}}) + \alpha(N^-_{\text{inter}}-N^+_{\text{inter}}) - \beta\sum_{m=1}^k |S_m|^2$, which promotes density while avoiding unbalanced partitions. It develops the first scalable local-search approach for PCD by embedding the problem in a block-coordinate Frank–Wolfe (FW) framework with a relaxed simplex representation, proving an equivalence to discrete local search when starting from a discrete solution and achieving a linear $O(1/t)$ convergence rate. The method, LSPCD, employs efficient gradient computations and a precomputed matrix to reduce per-iteration cost to $O(n)$ (after an initial $O(kn^2)$ setup), enabling large-scale experiments. Empirical results on real and synthetic data show that LSPCD delivers high polarity with balanced cluster sizes and competitive runtimes, providing a practical tool for analyzing polarization and trust dynamics in online and offline social systems.

Abstract

Signed networks, where edges are labeled as positive or negative to represent friendly or antagonistic interactions, provide a natural framework for analyzing polarization, trust, and conflict in social systems. Detecting meaningful group structures in such networks is crucial for understanding online discourse, political divisions, and trust dynamics. A key challenge is to identify communities that are internally cohesive and externally antagonistic, while allowing for neutral or unaligned vertices. In this paper, we propose a method for identifying $k$ polarized communities that addresses a major limitation of prior methods: their tendency to produce highly size-imbalanced solutions. We introduce a novel optimization objective that avoids such imbalance. In addition, it is well known that approximation algorithms based on local search are highly effective for clustering signed networks when neutral vertices are not allowed. We build on this idea and design the first local search algorithm that extends to the setting with neutral vertices while scaling to large networks. By connecting our approach to block-coordinate Frank-Wolfe optimization, we prove a linear convergence rate, enabled by the structure of our objective. Experiments on real-world and synthetic datasets demonstrate that our method consistently outperforms state-of-the-art baselines in solution quality, while remaining competitive in computational efficiency.

An Efficient Local Search Approach for Polarized Community Discovery in Signed Networks

TL;DR

This work tackles polarized community discovery in signed networks by allowing a neutral vertex set and proposing a CC-based objective , which promotes density while avoiding unbalanced partitions. It develops the first scalable local-search approach for PCD by embedding the problem in a block-coordinate Frank–Wolfe (FW) framework with a relaxed simplex representation, proving an equivalence to discrete local search when starting from a discrete solution and achieving a linear convergence rate. The method, LSPCD, employs efficient gradient computations and a precomputed matrix to reduce per-iteration cost to (after an initial setup), enabling large-scale experiments. Empirical results on real and synthetic data show that LSPCD delivers high polarity with balanced cluster sizes and competitive runtimes, providing a practical tool for analyzing polarization and trust dynamics in online and offline social systems.

Abstract

Signed networks, where edges are labeled as positive or negative to represent friendly or antagonistic interactions, provide a natural framework for analyzing polarization, trust, and conflict in social systems. Detecting meaningful group structures in such networks is crucial for understanding online discourse, political divisions, and trust dynamics. A key challenge is to identify communities that are internally cohesive and externally antagonistic, while allowing for neutral or unaligned vertices. In this paper, we propose a method for identifying polarized communities that addresses a major limitation of prior methods: their tendency to produce highly size-imbalanced solutions. We introduce a novel optimization objective that avoids such imbalance. In addition, it is well known that approximation algorithms based on local search are highly effective for clustering signed networks when neutral vertices are not allowed. We build on this idea and design the first local search algorithm that extends to the setting with neutral vertices while scaling to large networks. By connecting our approach to block-coordinate Frank-Wolfe optimization, we prove a linear convergence rate, enabled by the structure of our objective. Experiments on real-world and synthetic datasets demonstrate that our method consistently outperforms state-of-the-art baselines in solution quality, while remaining competitive in computational efficiency.

Paper Structure

This paper contains 27 sections, 17 theorems, 53 equations, 7 figures, 11 tables, 3 algorithms.

Key Result

Theorem 1

Problem problem-kpcd (i.e., $k$-PCD) is NP-hard.

Figures (7)

  • Figure 1: Comparison of runtime for the three implementations of Alg. \ref{['alg:ls']} (local search) introduced in Section \ref{['section:complexity']} by varying the graph size $n$ and the number of non-neutral clusters $k$, using data generated from the m-SSBM model. See Section \ref{['section:experiments']} for a description of this dataset. The noise level is fixed at $\eta = 0.4$. When varying $n$, we fix $k = 4$; when varying $k$, we fix $n = 5000$. LSPCD corresponds to Alg. \ref{['alg:ls2']} and is used in all subsequent experiments because of its superior computational efficiency.
  • Figure 2: F1-score and polarity of different methods on synthetic graphs generated using the m-SSBM model, as the noise level $\eta$ varies. See main text below for details. See Appendix \ref{['appendix:syntheticresults']} for more results.
  • Figure 3: F1-score and polarity of different methods on synthetic graphs generated using the m-SSBM model, as the size ratio parameter $\rho$ varies. A larger value of $\rho$ means the ground-truth non-neutral clusters are more imbalanced. The noise level is fixed to $\eta = 0.4$.
  • Figure 4: Runtime comparison on large-scale synthetic datasets generated using the m-SSBM model. LSPCD consistently achieves higher F1-scores than SCG-MA while requiring less runtime, demonstrating superior scalability and efficiency. We fixed $\eta = 0.45$, $k = 6$, and $\ell = 500$.
  • Figure 5: Impact of $\beta$ on the WikiPol dataset with $k = 6$.
  • ...and 2 more figures

Theorems & Definitions (28)

  • Theorem 1
  • Proposition 1
  • Theorem 2
  • Corollary 1
  • Definition 1: FW duality gap
  • Definition 2: Convergence rate
  • Theorem 3
  • Theorem 4
  • Proposition 2
  • Proposition 3
  • ...and 18 more