DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings

Hong Liu; Yitong Lu

DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings

Hong Liu, Yitong Lu

TL;DR

A simple yet effective method that leverages random sentences and Canonical Correlation Analysis to enrich the text embeddings of the foundation model and uses CCA double twice to align the representations and reconstruct them back to the original representation space is proposed.

Abstract

This paper presents a novel method to improve the robustness of foundation models to group-based biases. We propose a simple yet effective method, called DoubleCCA, that leverages random sentences and Canonical Correlation Analysis (CCA) to enrich the text embeddings of the foundation model. First, we generate various random sentences that augment the original prompts, which extends the original prompts with random words or character sequences. Second, we use an additional sentence embedding model to generate different text embeddings with respect to these random sentences. We then use CCA double twice to align the representations and reconstruct them back to the original representation space. We demonstrate the effectiveness of our method on a variety of tasks and datasets, showing that it outperforms existing methods in terms of both performance and robustness. Our method is simple to implement and can be easily integrated into existing models, making it a practical solution for improving the robustness of foundation models to group-based biases.

DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings

TL;DR

Abstract

DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)