RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

Javad Rafiei Asl; Prajwal Panzade; Eduardo Blanco; Daniel Takabi; Zhipeng Cai

RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

Javad Rafiei Asl, Prajwal Panzade, Eduardo Blanco, Daniel Takabi, Zhipeng Cai

TL;DR

This paper introduces RobustSentEmbed, a self-supervised sentence embedding framework designed to improve both generalization and robustness in diverse text representation tasks and against a diverse set of adversarial attacks.

Abstract

Pre-trained language models (PLMs) have consistently demonstrated outstanding performance across a diverse spectrum of natural language processing tasks. Nevertheless, despite their success with unseen data, current PLM-based representations often exhibit poor robustness in adversarial settings. In this paper, we introduce RobustSentEmbed, a self-supervised sentence embedding framework designed to improve both generalization and robustness in diverse text representation tasks and against a diverse set of adversarial attacks. Through the generation of high-risk adversarial perturbations and their utilization in a novel objective function, RobustSentEmbed adeptly learns high-quality and robust sentence embeddings. Our experiments confirm the superiority of RobustSentEmbed over state-of-the-art representations. Specifically, Our framework achieves a significant reduction in the success rate of various adversarial attacks, notably reducing the BERTAttack success rate by almost half (from 75.51\% to 38.81\%). The framework also yields improvements of 1.59\% and 0.23\% in semantic textual similarity tasks and various transfer tasks, respectively.

RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

TL;DR

Abstract

Paper Structure (22 sections, 14 equations, 6 figures, 6 tables, 1 algorithm)

This paper contains 22 sections, 14 equations, 6 figures, 6 tables, 1 algorithm.

Introduction
Related Work
The Proposed Framework
Perturbation Generator
Robust Contrastive Learning
Evaluation and Experimental Results
Adversarial Attacks
Robust Embeddings
Semantic Textual Similarity (STS) Tasks
Transfer Tasks
Distribution of Sentence Embeddings
Conclusion and Future Work
Limitations
Training Details
Ablation Studies
...and 7 more sections

Figures (6)

Figure 1: The general architecture of the RobustSentEmbed framework.
Figure 2: Average number of queries and the resulting accuracy reduction for two fine-tuned embeddings.
Figure 3: $\ell_\text{align}-\ell_\text{uniform}$ plot of models based on e. Lower uniformity and alignment is better.
Figure 4: The impact of step sizes in perturbation generation on the average performance of STS tasks.
Figure 5: The impact of the step number (represented by N = K or T) in the T-step FGSM and K-step PGD methods on the averaged correlation of the STS tasks.
...and 1 more figures

RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

TL;DR

Abstract

RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (6)