Table of Contents
Fetching ...

Boundless Across Domains: A New Paradigm of Adaptive Feature and Cross-Attention for Domain Generalization in Medical Image Segmentation

Yuheng Xu, Taiping Zhang

TL;DR

An Adaptive Feature Blending method is proposed that generates out-of-distribution samples while exploring the in-distribution space, significantly expanding the domain range and achieving superior performance on standard domain generalization benchmarks for medical image segmentation.

Abstract

Domain-invariant representation learning is a powerful method for domain generalization. Previous approaches face challenges such as high computational demands, training instability, and limited effectiveness with high-dimensional data, potentially leading to the loss of valuable features. To address these issues, we hypothesize that an ideal generalized representation should exhibit similar pattern responses within the same channel across cross-domain images. Based on this hypothesis, we use deep features from the source domain as queries, and deep features from the generated domain as keys and values. Through a cross-channel attention mechanism, the original deep features are reconstructed into robust regularization representations, forming an explicit constraint that guides the model to learn domain-invariant representations. Additionally, style augmentation is another common method. However, existing methods typically generate new styles through convex combinations of source domains, which limits the diversity of training samples by confining the generated styles to the original distribution. To overcome this limitation, we propose an Adaptive Feature Blending (AFB) method that generates out-of-distribution samples while exploring the in-distribution space, significantly expanding the domain range. Extensive experimental results demonstrate that our proposed methods achieve superior performance on two standard domain generalization benchmarks for medical image segmentation.

Boundless Across Domains: A New Paradigm of Adaptive Feature and Cross-Attention for Domain Generalization in Medical Image Segmentation

TL;DR

An Adaptive Feature Blending method is proposed that generates out-of-distribution samples while exploring the in-distribution space, significantly expanding the domain range and achieving superior performance on standard domain generalization benchmarks for medical image segmentation.

Abstract

Domain-invariant representation learning is a powerful method for domain generalization. Previous approaches face challenges such as high computational demands, training instability, and limited effectiveness with high-dimensional data, potentially leading to the loss of valuable features. To address these issues, we hypothesize that an ideal generalized representation should exhibit similar pattern responses within the same channel across cross-domain images. Based on this hypothesis, we use deep features from the source domain as queries, and deep features from the generated domain as keys and values. Through a cross-channel attention mechanism, the original deep features are reconstructed into robust regularization representations, forming an explicit constraint that guides the model to learn domain-invariant representations. Additionally, style augmentation is another common method. However, existing methods typically generate new styles through convex combinations of source domains, which limits the diversity of training samples by confining the generated styles to the original distribution. To overcome this limitation, we propose an Adaptive Feature Blending (AFB) method that generates out-of-distribution samples while exploring the in-distribution space, significantly expanding the domain range. Extensive experimental results demonstrate that our proposed methods achieve superior performance on two standard domain generalization benchmarks for medical image segmentation.

Paper Structure

This paper contains 11 sections, 5 equations, 3 figures, 3 tables.

Figures (3)

  • Figure 1: Visualization of synthetic feature statistics samples using MixStyle, FDA, and our proposed Adaptive Feature Blending (AFB) method.
  • Figure 2: Overview of our proposed method. AFB generates out-of-distribution samples while exploring the in-distribution space, significantly expanding the domain range. DCAR reconstructs the original deep features and the generated deep features into robust regularization constraints through a cross-attention mechanism, thereby guiding the model to learn domain-invariant representations.
  • Figure 3: Visual comparison for Fundus and Prostate segmentation task. The red contours indicate the boundaries of ground truths while the green and blue contours are predictions.