Table of Contents
Fetching ...

A2XP: Towards Private Domain Generalization

Geunhyeok Yu, Hyoseok Hwang

TL;DR

A2XP reframes domain generalization as a direction regression problem that preserves network privacy by decoupling the process into Expert Adaptation and Attention-based Generalization. Per-source-domain prompts are optimized in Expert Adaptation, and a target-domain prompt is formed through a learned linear combination fused by cross-attention via two lightweight embedder networks in Attention-based Generalization. Empirical results on PACS and VLCS demonstrate state-of-the-art performance among non-private DG methods with fewer updates, along with thorough ablations showing the importance of prompt normalization, initialization, and the attention-based fusion. The approach offers a practical, privacy-conscious pathway to robust DG in computer vision, with potential extensions to broader modalities and tasks.

Abstract

Deep Neural Networks (DNNs) have become pivotal in various fields, especially in computer vision, outperforming previous methodologies. A critical challenge in their deployment is the bias inherent in data across different domains, such as image style and environmental conditions, leading to domain gaps. This necessitates techniques for learning general representations from biased training data, known as domain generalization. This paper presents Attend to eXpert Prompts (A2XP), a novel approach for domain generalization that preserves the privacy and integrity of the network architecture. A2XP consists of two phases: Expert Adaptation and Domain Generalization. In the first phase, prompts for each source domain are optimized to guide the model towards the optimal direction. In the second phase, two embedder networks are trained to effectively amalgamate these expert prompts, aiming for an optimal output. Our extensive experiments demonstrate that A2XP achieves state-of-the-art results over existing non-private domain generalization methods. The experimental results validate that the proposed approach not only tackles the domain generalization challenge in DNNs but also offers a privacy-preserving, efficient solution to the broader field of computer vision.

A2XP: Towards Private Domain Generalization

TL;DR

A2XP reframes domain generalization as a direction regression problem that preserves network privacy by decoupling the process into Expert Adaptation and Attention-based Generalization. Per-source-domain prompts are optimized in Expert Adaptation, and a target-domain prompt is formed through a learned linear combination fused by cross-attention via two lightweight embedder networks in Attention-based Generalization. Empirical results on PACS and VLCS demonstrate state-of-the-art performance among non-private DG methods with fewer updates, along with thorough ablations showing the importance of prompt normalization, initialization, and the attention-based fusion. The approach offers a practical, privacy-conscious pathway to robust DG in computer vision, with potential extensions to broader modalities and tasks.

Abstract

Deep Neural Networks (DNNs) have become pivotal in various fields, especially in computer vision, outperforming previous methodologies. A critical challenge in their deployment is the bias inherent in data across different domains, such as image style and environmental conditions, leading to domain gaps. This necessitates techniques for learning general representations from biased training data, known as domain generalization. This paper presents Attend to eXpert Prompts (A2XP), a novel approach for domain generalization that preserves the privacy and integrity of the network architecture. A2XP consists of two phases: Expert Adaptation and Domain Generalization. In the first phase, prompts for each source domain are optimized to guide the model towards the optimal direction. In the second phase, two embedder networks are trained to effectively amalgamate these expert prompts, aiming for an optimal output. Our extensive experiments demonstrate that A2XP achieves state-of-the-art results over existing non-private domain generalization methods. The experimental results validate that the proposed approach not only tackles the domain generalization challenge in DNNs but also offers a privacy-preserving, efficient solution to the broader field of computer vision.
Paper Structure (21 sections, 6 equations, 6 figures, 4 tables, 1 algorithm)

This paper contains 21 sections, 6 equations, 6 figures, 4 tables, 1 algorithm.

Figures (6)

  • Figure 1: Flow diagram of the proposed method A2XP.
  • Figure 2: Inference procedure of A2XP. There are experts from source domains and target images of an unseen target domain. The experts are image-dependently mixed through an attention-based algorithm and added to the specific image.
  • Figure 3: Geometric concept of A2XP as a linear combination in 2D manifold space with two source domains.
  • Figure 4: Comparison of two generalization strategies about fixing or tuning the experts in the generalization step.
  • Figure 5: Activation visualization of A2XP using Grad-CAM grad_cam. (a) shows the input image, (c) and (d) show the relative gain and loss of activation using A2XP prompts in (b), respectively.
  • ...and 1 more figures