Finding Densest Subgraphs with Edge-Color Constraints

Lutz Oettershagen; Honglian Wang; Aristides Gionis

Finding Densest Subgraphs with Edge-Color Constraints

Lutz Oettershagen, Honglian Wang, Aristides Gionis

TL;DR

This work studies densest subgraphs under edge-color constraints, formalizing edge-colored DSP with per-color quotas h and variants for at least, at most, and exactly colored edges. It establishes NP-completeness for the decision versions (even with two colors) and provides linear-time constant-factor approximations for the at least h colored-edges variant in everywhere sparse graphs, along with a related non-colored variant. The methods extend to graphs with multiple edge colors via a multi-graph transformation, preserving constant-factor guarantees. Experiments on real networks demonstrate strong practical performance and scalability, including a diverse coauthorship use case that highlights the benefits of edge diversity in dense communities.

Abstract

We consider a variant of the densest subgraph problem in networks with single or multiple edge attributes. For example, in a social network, the edge attributes may describe the type of relationship between users, such as friends, family, or acquaintances, or different types of communication. For conceptual simplicity, we view the attributes as edge colors. The new problem we address is to find a diverse densest subgraph that fulfills given requirements on the numbers of edges of specific colors. When searching for a dense social network community, our problem will enforce the requirement that the community is diverse according to criteria specified by the edge attributes. We show that the decision versions for finding exactly, at most, and at least $\textbf{h}$ colored edges densest subgraph, where $\textbf{h}$ is a vector of color requirements, are NP-complete, for already two colors. For the problem of finding a densest subgraph with at least $\textbf{h}$ colored edges, we provide a linear-time constant-factor approximation algorithm when the input graph is sparse. On the way, we introduce the related at least $h$ (non-colored) edges densest subgraph problem, show its hardness, and also provide a linear-time constant-factor approximation. In our experiments, we demonstrate the efficacy and efficiency of our new algorithms.

Finding Densest Subgraphs with Edge-Color Constraints

TL;DR

Abstract

colored edges densest subgraph, where

is a vector of color requirements, are NP-complete, for already two colors. For the problem of finding a densest subgraph with at least

colored edges, we provide a linear-time constant-factor approximation algorithm when the input graph is sparse. On the way, we introduce the related at least

(non-colored) edges densest subgraph problem, show its hardness, and also provide a linear-time constant-factor approximation. In our experiments, we demonstrate the efficacy and efficiency of our new algorithms.

Paper Structure (16 sections, 13 theorems, 2 equations, 7 figures, 6 tables, 5 algorithms)

This paper contains 16 sections, 13 theorems, 2 equations, 7 figures, 6 tables, 5 algorithms.

Introduction
Related Work
Problem Definitions
Approximation in Sparse Graphs
Solving the at Least $h$-Edges DSP
Approximation of the at Least $\mathbf{h}\xspace$ Colored Edges DSP
Graphs with Multiple Edge Colors
Experiments
Results and Discussion
Use Case: Diverse Coauthorship
Conclusion and Future Work
Omitted Proofs
Proofs of \ref{['sec:problems']}
Proofs of \ref{['sec:approx']}
ILP Formulations
...and 1 more sections

Key Result

Theorem 1

The decision versions of the exactly, at most, and at least $\mathbf{h}\xspace$ colored edges version are $\mathbf{NP}$-complete.

Figures (7)

Figure 1: Example for the at least $\mathbf{h}\xspace$ colored edges densest subgraph problem in a toy social network with two relationship types. The subgraph induced by $S_1$ is the densest unconstrained subgraph. If we require the densest subgraph to contain at least four edges of type two (red dashed), the graph induced by $S_2$ is optimal.
Figure 2: The density computed with AtLeastHApprox for increasing numbers of required edges.
Figure 3: The distributions of colors in various data sets.
Figure 4: Comparison of the heuristic, approximation algorithm, and exact ILP.
Figure 5: Densities for increasing color requirements (the common legend is shown in (a)).
...and 2 more figures

Theorems & Definitions (15)

Definition 1
Theorem 1
Definition 2
Theorem 2
Lemma 1
Lemma 2
Theorem 3
Theorem 4
Theorem 5
Theorem 6
...and 5 more

Finding Densest Subgraphs with Edge-Color Constraints

TL;DR

Abstract

Finding Densest Subgraphs with Edge-Color Constraints

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (15)