TaGAT: Topology-Aware Graph Attention Network For Multi-modal Retinal Image Fusion

Xin Tian; Nantheera Anantrasirichai; Lindsay Nicholson; Alin Achim

TaGAT: Topology-Aware Graph Attention Network For Multi-modal Retinal Image Fusion

Xin Tian, Nantheera Anantrasirichai, Lindsay Nicholson, Alin Achim

TL;DR

TaGAT introduces a topology-aware Graph Attention Network for multi-modal retinal image fusion, bridging Euclidean image features with a non-Euclidean retinal vessel graph via a Graph Attention-based Topology-Aware Encoder. It combines a Long-short Range (LSR) encoder, a Spatial-to-Graph (S2G) mapping, a GAT-based Graph Information Update (GIU), and a Graph-to-Spatial (G2S) diffusion to produce a fused image, trained in two stages with specialized losses that preserve structure and texture. Across DRFF and OCT2Confocal datasets, TaGAT achieves state-of-the-art or competitive fusion metrics, with strong preservation of fine vasculature, optic disc details, and reduced border artefacts, validating the effectiveness of incorporating vascular topology into fusion. The framework offers a principled, topology-aware approach that can generalize across modalities and potentially enhance downstream ophthalmic analyses such as vessel segmentation and disease monitoring.

Abstract

In the realm of medical image fusion, integrating information from various modalities is crucial for improving diagnostics and treatment planning, especially in retinal health, where the important features exhibit differently in different imaging modalities. Existing deep learning-based approaches insufficiently focus on retinal image fusion, and thus fail to preserve enough anatomical structure and fine vessel details in retinal image fusion. To address this, we propose the Topology-Aware Graph Attention Network (TaGAT) for multi-modal retinal image fusion, leveraging a novel Topology-Aware Encoder (TAE) with Graph Attention Networks (GAT) to effectively enhance spatial features with retinal vasculature's graph topology across modalities. The TAE encodes the base and detail features, extracted via a Long-short Range (LSR) encoder from retinal images, into the graph extracted from the retinal vessel. Within the TAE, the GAT-based Graph Information Update (GIU) block dynamically refines and aggregates the node features to generate topology-aware graph features. The updated graph features with base and detail features are combined and decoded as a fused image. Our model outperforms state-of-the-art methods in Fluorescein Fundus Angiography (FFA) with Color Fundus (CF) and Optical Coherence Tomography (OCT) with confocal microscopy retinal image fusion. The source code can be accessed via https://github.com/xintian-99/TaGAT.

TaGAT: Topology-Aware Graph Attention Network For Multi-modal Retinal Image Fusion

TL;DR

Abstract

TaGAT: Topology-Aware Graph Attention Network For Multi-modal Retinal Image Fusion

Authors

TL;DR

Abstract

Table of Contents

Figures (2)