OpenMAG: A Comprehensive Benchmark for Multimodal-Attributed Graph

Chenxi Wan; Xunkai Li; Yilong Zuo; Haokun Deng; Sihan Li; Bowen Fan; Hongchao Qin; Ronghua Li; Guoren Wang

OpenMAG: A Comprehensive Benchmark for Multimodal-Attributed Graph

Chenxi Wan, Xunkai Li, Yilong Zuo, Haokun Deng, Sihan Li, Bowen Fan, Hongchao Qin, Ronghua Li, Guoren Wang

TL;DR

OpenMAG formalizes Multimodal-Attributed Graphs (MAGs) and presents a comprehensive benchmark to standardize evaluation across datasets, encoders, models, and tasks. It defines MAG as $\mathcal{G}= (\mathcal{V},\mathcal{E},\mathcal{X}_{\mathcal{M}})$ with modalities $\mathcal{M}=\{T,V\}$ and a four-stage pipeline (encoding, fusion, interaction, adaptation), and evaluates 8 downstream tasks using diverse metrics. The benchmark integrates 19 datasets over 6 domains, supports 16 encoders and 24 MAG models across Graph-, Multimodal-, and MLLM-enhanced paradigms, and scrutinizes five evaluation dimensions to yield 14 actionable insights. This open-source framework enables fair, large-scale comparisons and guides future MAG research toward unified, robust, and scalable solutions with practical impact in cross-modal reasoning and generation on graphs.

Abstract

Multimodal-Attributed Graph (MAG) learning has achieved remarkable success in modeling complex real-world systems by integrating graph topology with rich attributes from multiple modalities. With the rapid proliferation of novel MAG models capable of handling intricate cross-modal semantics and structural dependencies, establishing a rigorous and unified evaluation standard has become imperative. Although existing benchmarks have facilitated initial progress, they exhibit critical limitations in domain coverage, encoder flexibility, model diversity, and task scope, presenting significant challenges to fair evaluation. To bridge this gap, we present OpenMAG, a comprehensive benchmark that integrates 19 datasets across 6 domains and incorporates 16 encoders to support both static and trainable feature encoding. OpenMAG further implements a standardized library of 24 state-of-the-art models and supports 8 downstream tasks, enabling fair comparisons within a unified framework. Through systematic assessment of necessity, data quality, effectiveness, robustness, and efficiency, we derive 14 fundamental insights into MAG learning to guide future advancements. Our code is available at https://github.com/YUKI-N810/OpenMAG.

OpenMAG: A Comprehensive Benchmark for Multimodal-Attributed Graph

TL;DR

OpenMAG formalizes Multimodal-Attributed Graphs (MAGs) and presents a comprehensive benchmark to standardize evaluation across datasets, encoders, models, and tasks. It defines MAG as

with modalities

and a four-stage pipeline (encoding, fusion, interaction, adaptation), and evaluates 8 downstream tasks using diverse metrics. The benchmark integrates 19 datasets over 6 domains, supports 16 encoders and 24 MAG models across Graph-, Multimodal-, and MLLM-enhanced paradigms, and scrutinizes five evaluation dimensions to yield 14 actionable insights. This open-source framework enables fair, large-scale comparisons and guides future MAG research toward unified, robust, and scalable solutions with practical impact in cross-modal reasoning and generation on graphs.

Abstract

Paper Structure (33 sections, 11 equations, 7 figures, 8 tables)

This paper contains 33 sections, 11 equations, 7 figures, 8 tables.

INTRODUCTION
Preliminaries
Notations and Learning Paradigms
Problem Formulation
BENCHMARK DESIGN
Dataset Overview For OpenMAG
Encoder Module For OpenMAG
Model Library For OpenMAG
Task Evaluation For OpenMAG
EXPERIMENTS AND ANALYSIS
Validation of Structure-Modality Necessity
Assessment of Multimodal Feature Quality
Effectiveness Analysis
Robustness Analysis
Complexity and Practical Efficiency Analysis
...and 18 more sections

Figures (7)

Figure 1: The overall framework of OpenMAG.
Figure 2: Performance comparison on unimodal and multimodal settings.
Figure 3: Impact of fine-tuning strategies.
Figure 4: Comparison of encoding strategies.
Figure 5: Robustness analysis on different models.
...and 2 more figures

OpenMAG: A Comprehensive Benchmark for Multimodal-Attributed Graph

TL;DR

Abstract

OpenMAG: A Comprehensive Benchmark for Multimodal-Attributed Graph

Authors

TL;DR

Abstract

Table of Contents

Figures (7)