Table of Contents
Fetching ...

Accelerating Maximum Common Subgraph Computation by Exploiting Symmetries

Buddhi Kothalawala, Henning Koehler, Muhammad Farhan

Abstract

The Maximum Common Subgraph (MCS) problem plays a key role in many applications, including cheminformatics, bioinformatics, and pattern recognition, where it is used to identify the largest shared substructure between two graphs. Although symmetry exploitation is a powerful means of reducing search space in combinatorial optimization, its potential in MCS algorithms has remained largely underexplored due to the challenges of detecting and integrating symmetries effectively. Existing approaches, such as RRSplit, partially address symmetry through vertex-equivalence reasoning on the variable graph, but symmetries in the value graph remain unexploited. In this work, we introduce a complete dual-symmetry breaking framework that simultaneously handles symmetries in both variable and value graphs. Our method identifies and exploits modular symmetries based on local neighborhood structures, allowing the algorithm to prune isomorphic subtrees during search while rigorously preserving optimality. Extensive experiments on standard MCS benchmarks show that our approach substantially outperforms the state-of-the-art RRSplit algorithm, solving more instances with significant reductions in both computation time and search space. These results highlight the practical effectiveness of comprehensive symmetry-aware pruning for accelerating exact MCS computation.

Accelerating Maximum Common Subgraph Computation by Exploiting Symmetries

Abstract

The Maximum Common Subgraph (MCS) problem plays a key role in many applications, including cheminformatics, bioinformatics, and pattern recognition, where it is used to identify the largest shared substructure between two graphs. Although symmetry exploitation is a powerful means of reducing search space in combinatorial optimization, its potential in MCS algorithms has remained largely underexplored due to the challenges of detecting and integrating symmetries effectively. Existing approaches, such as RRSplit, partially address symmetry through vertex-equivalence reasoning on the variable graph, but symmetries in the value graph remain unexploited. In this work, we introduce a complete dual-symmetry breaking framework that simultaneously handles symmetries in both variable and value graphs. Our method identifies and exploits modular symmetries based on local neighborhood structures, allowing the algorithm to prune isomorphic subtrees during search while rigorously preserving optimality. Extensive experiments on standard MCS benchmarks show that our approach substantially outperforms the state-of-the-art RRSplit algorithm, solving more instances with significant reductions in both computation time and search space. These results highlight the practical effectiveness of comprehensive symmetry-aware pruning for accelerating exact MCS computation.
Paper Structure (20 sections, 6 theorems, 8 equations, 11 figures, 5 tables, 1 algorithm)

This paper contains 20 sections, 6 theorems, 8 equations, 11 figures, 5 tables, 1 algorithm.

Key Result

Theorem 3.1

A vertex $v$ cannot participate in both negative and positive symmetry relationships.

Figures (11)

  • Figure 1: Two example graphs $G$ and $H$, with symmetric vertex sets $\{4, 5\}$, $\{7, 9\}$, and $\{d, e\}$. The pairs $(4, 5)$ and $(7, 9)$ exhibit negative symmetry, while $(d, e)$ exhibit positive symmetry. The maximum common induced subgraph (MCIS) is shown in bold, with the vertex mapping $M = \{(0, a), (1, b), (2, d), (3, g), (4, c), (6, j), (7, i)\}$.
  • Figure 2: An illustration of a partial MCIS search tree demonstrating the proposed symmetry breaking rule. Grey nodes represent subproblems that are isomorphic to their left siblings and are pruned using either value-symmetry or variable-symmetry. The left box shows pruning based on H-symmetry, while the right box shows pruning based on variable-symmetry.
  • Figure 3: Comparing completion times across all datasets for instances solved by at least one method.
  • Figure 4: Cumulative distribution of solved instances across all datasets with respect to completion time.
  • Figure 5: Comparing branches across all datasets for instances solved by both methods.
  • ...and 6 more figures

Theorems & Definitions (20)

  • Definition 2.1: Maximum Common Induced Subgraph
  • Example 2.1
  • Example 2.2
  • Definition 3.1: Properties of MCIS Algorithms
  • Definition 3.2: Neighborhood
  • Definition 3.3: Modular Symmetry
  • Theorem 3.1: Mutual Exclusiveness
  • Example 3.1
  • Theorem 3.2
  • Definition 3.4: Value-Lexicographical Order
  • ...and 10 more