Table of Contents
Fetching ...

Text Embedding Inversion Security for Multilingual Language Models

Yiyi Chen, Heather Lent, Johannes Bjerva

TL;DR

This work analyzes embedding inversion security for multilingual language models in a black-box setting, extending prior English-focused studies to examine cross-lingual and multilingual vulnerabilities. It defines multilingual and cross-lingual inversion attacks, builds a Vec2Text-style attacker framework using ME5-base and MTG data, and evaluates performance across English and four additional languages. The study reveals that multilingual models can be more vulnerable under certain conditions and that defenses designed for monolingual English defenses often fail in multilingual contexts; it also introduces a simple masking defense that preserves retrieval while substantially reducing reconstruction. The findings underscore the importance of multilingual security research and provide open-source tools to spur further defenses and evaluations across diverse languages.

Abstract

Textual data is often represented as real-numbered embeddings in NLP, particularly with the popularity of large language models (LLMs) and Embeddings as a Service (EaaS). However, storing sensitive information as embeddings can be susceptible to security breaches, as research shows that text can be reconstructed from embeddings, even without knowledge of the underlying model. While defence mechanisms have been explored, these are exclusively focused on English, leaving other languages potentially exposed to attacks. This work explores LLM security through multilingual embedding inversion. We define the problem of black-box multilingual and cross-lingual inversion attacks, and explore their potential implications. Our findings suggest that multilingual LLMs may be more vulnerable to inversion attacks, in part because English-based defences may be ineffective. To alleviate this, we propose a simple masking defense effective for both monolingual and multilingual models. This study is the first to investigate multilingual inversion attacks, shedding light on the differences in attacks and defenses across monolingual and multilingual settings.

Text Embedding Inversion Security for Multilingual Language Models

TL;DR

This work analyzes embedding inversion security for multilingual language models in a black-box setting, extending prior English-focused studies to examine cross-lingual and multilingual vulnerabilities. It defines multilingual and cross-lingual inversion attacks, builds a Vec2Text-style attacker framework using ME5-base and MTG data, and evaluates performance across English and four additional languages. The study reveals that multilingual models can be more vulnerable under certain conditions and that defenses designed for monolingual English defenses often fail in multilingual contexts; it also introduces a simple masking defense that preserves retrieval while substantially reducing reconstruction. The findings underscore the importance of multilingual security research and provide open-source tools to spur further defenses and evaluations across diverse languages.

Abstract

Textual data is often represented as real-numbered embeddings in NLP, particularly with the popularity of large language models (LLMs) and Embeddings as a Service (EaaS). However, storing sensitive information as embeddings can be susceptible to security breaches, as research shows that text can be reconstructed from embeddings, even without knowledge of the underlying model. While defence mechanisms have been explored, these are exclusively focused on English, leaving other languages potentially exposed to attacks. This work explores LLM security through multilingual embedding inversion. We define the problem of black-box multilingual and cross-lingual inversion attacks, and explore their potential implications. Our findings suggest that multilingual LLMs may be more vulnerable to inversion attacks, in part because English-based defences may be ineffective. To alleviate this, we propose a simple masking defense effective for both monolingual and multilingual models. This study is the first to investigate multilingual inversion attacks, shedding light on the differences in attacks and defenses across monolingual and multilingual settings.
Paper Structure (41 sections, 3 equations, 6 figures, 18 tables)

This paper contains 41 sections, 3 equations, 6 figures, 18 tables.

Figures (6)

  • Figure 1: Schematic overview of a text embedding inversion attack. A user accesses an EaaS provider, while an attacker is eavesdropping. Although the attacker has no direct access to the embedding model, they can reliably decode the information stored in the embeddings.
  • Figure 2: Overview of Multilingual Vec2Text, extending Vec2Text morris-etal-2023-text with Ad hoc Translation and Masking Defense Mechanism (outlined in the green dashed line frame). Given access to a target embedding $e$ and query access to the embedder $\mathbf{\phi}$ via an EaaS API, the inversion model $\psi$ iteratively generates hypotheses $\hat{e}$ to attain the target. The generated text $\hat{x}$ is in German, and translated to English (AdTrans($\hat{x}$)), to be compared with the target text $x$. The masking defense serves as an effective defense against inversion attacks while preserving utility in NLP tasks such as retrieval.
  • Figure 3: Retrieval and Reconstruction performance across varying levels of noise injection with monolingual (GTR-Based) and multilingual (ME5-Based) language models on BEIR (top) and CLIRMatrix (bottom) datasets. The red dotted lines indicate the noise level at which the disparity of efficacy of defense between monolingual and monolingual embeddings emerges.
  • Figure 4: Retrieval and Reconstruction performance with masked monolingual (GTR-Based) and multilingual (ME5-Based) language models on BEIR (top) and CLIRMatrix (bottom) datasets. The red dashed lines indicate the performance drop in percentage.
  • Figure 5: The Distribution of the training data for models with the maximal token length of 32.
  • ...and 1 more figures