Table of Contents
Fetching ...

Modeling Hypergraph Using Large Language Models

Bingqiao Gu, Jiale Zeng, Xingqin Qi, Dong Li

TL;DR

This work tackles the scarcity of large-scale real hypergraph data by introducing HyperLLM, an LLM-driven hypergraph generator that uses a two-phase framework and a four-agent collaboration to synthesize realistic, semantically coherent hypergraphs. By grounding generation in eight universal real-world patterns and a microscopic high-order preferential attachment model, HyperLLM achieves high fidelity with minimal prior information. The method combines construction-phase iterative generation with a multi-agent evolution phase guided by high-order prompts, enabling efficient yet rich hypergraph synthesis. Empirical results on eight datasets demonstrate superior structural and dynamic pattern alignment, suggesting LLM-based, agent-driven frameworks as a promising direction for scalable, realistic hypergraph modeling.

Abstract

Due to the advantages of hypergraphs in modeling high-order relationships in complex systems, they have been applied to higher-order clustering, hypergraph neural networks and computer vision. These applications rely heavily on access to high-quality, large-scale real-world hypergraph data. Yet, compared to traditional pairwise graphs, real hypergraph datasets remain scarce in both scale and diversity. This shortage significantly limits the development and evaluation of advanced hypergraph learning algorithms. Therefore, how to quickly generate large-scale hypergraphs that conform to the characteristics of real networks is a crucial task that has not received sufficient attention. Motivated by recent advances in large language models (LLMs), particularly their capabilities in semantic reasoning, structured generation, and simulating human behavior, we investigate whether LLMs can facilitate hypergraph generation from a fundamentally new perspective. We introduce HyperLLM, a novel LLM-driven hypergraph generator that simulates the formation and evolution of hypergraphs through a multi-agent collaboration. The framework integrates prompts and structural feedback mechanisms to ensure that the generated hypergraphs reflect key real-world patterns. Extensive experiments across diverse datasets demonstrate that HyperLLM achieves superior fidelity to structural and temporal hypergraph patterns, while requiring minimal statistical priors. Our findings suggest that LLM-based frameworks offer a promising new direction for hypergraph modeling.

Modeling Hypergraph Using Large Language Models

TL;DR

This work tackles the scarcity of large-scale real hypergraph data by introducing HyperLLM, an LLM-driven hypergraph generator that uses a two-phase framework and a four-agent collaboration to synthesize realistic, semantically coherent hypergraphs. By grounding generation in eight universal real-world patterns and a microscopic high-order preferential attachment model, HyperLLM achieves high fidelity with minimal prior information. The method combines construction-phase iterative generation with a multi-agent evolution phase guided by high-order prompts, enabling efficient yet rich hypergraph synthesis. Empirical results on eight datasets demonstrate superior structural and dynamic pattern alignment, suggesting LLM-based, agent-driven frameworks as a promising direction for scalable, realistic hypergraph modeling.

Abstract

Due to the advantages of hypergraphs in modeling high-order relationships in complex systems, they have been applied to higher-order clustering, hypergraph neural networks and computer vision. These applications rely heavily on access to high-quality, large-scale real-world hypergraph data. Yet, compared to traditional pairwise graphs, real hypergraph datasets remain scarce in both scale and diversity. This shortage significantly limits the development and evaluation of advanced hypergraph learning algorithms. Therefore, how to quickly generate large-scale hypergraphs that conform to the characteristics of real networks is a crucial task that has not received sufficient attention. Motivated by recent advances in large language models (LLMs), particularly their capabilities in semantic reasoning, structured generation, and simulating human behavior, we investigate whether LLMs can facilitate hypergraph generation from a fundamentally new perspective. We introduce HyperLLM, a novel LLM-driven hypergraph generator that simulates the formation and evolution of hypergraphs through a multi-agent collaboration. The framework integrates prompts and structural feedback mechanisms to ensure that the generated hypergraphs reflect key real-world patterns. Extensive experiments across diverse datasets demonstrate that HyperLLM achieves superior fidelity to structural and temporal hypergraph patterns, while requiring minimal statistical priors. Our findings suggest that LLM-based frameworks offer a promising new direction for hypergraph modeling.

Paper Structure

This paper contains 16 sections, 9 equations, 5 figures, 2 tables, 2 algorithms.

Figures (5)

  • Figure 1: HyperLLM generates realistic hypergraphs. The top row displays seven structural and dynamic patterns from a real-world email hypergraph. The bottom row shows that our model, HyperLLM, successfully reproduces all seven patterns.
  • Figure 2: The schematic diagram of the research motivation of this article.
  • Figure 3: The core mechanism of multi-agent collaborative hypergraph generation.
  • Figure 4: Comparison between a real hypergraph and those generated by HyperLLM with different attachment probabilities. The results, particularly for preferential attachment probability 0.85, closely mimic the real data patterns.
  • Figure 5: Hyperparameter sensitivity analysis of HyperLLM. The plots show performance on four datasets: (a) Email-Eu-core, (b) coauth-Geo, (c) DAWN, and (d) NDC-substances. The z-axis ($\gamma$) measures goodness of fit ($\gamma$, defined in Eq. \ref{['eq:gamma']}); higher is better.