Table of Contents
Fetching ...

Interactive Trimming against Evasive Online Data Manipulation Attacks: A Game-Theoretic Approach

Yue Fu, Qingqing Ye, Rong Du, Haibo Hu

TL;DR

An interactive game-theoretical model to defend online data manipulation attacks using the trimming strategy and two strategies are devised, namely, Tit-for-tat and Elastic, which are applicable to strong evasive and colluding adversaries.

Abstract

With the exponential growth of data and its crucial impact on our lives and decision-making, the integrity of data has become a significant concern. Malicious data poisoning attacks, where false values are injected into the data, can disrupt machine learning processes and lead to severe consequences. To mitigate these attacks, distance-based defenses, such as trimming, have been proposed, but they can be easily evaded by white-box attackers. The evasiveness and effectiveness of poisoning attack strategies are two sides of the same coin, making game theory a promising approach. However, existing game-theoretical models often overlook the complexities of online data poisoning attacks, where strategies must adapt to the dynamic process of data collection. In this paper, we present an interactive game-theoretical model to defend online data manipulation attacks using the trimming strategy. Our model accommodates a complete strategy space, making it applicable to strong evasive and colluding adversaries. Leveraging the principle of least action and the Euler-Lagrange equation from theoretical physics, we derive an analytical model for the game-theoretic process. To demonstrate its practical usage, we present a case study in a privacy-preserving data collection system under local differential privacy where a non-deterministic utility function is adopted. Two strategies are devised from this analytical model, namely, Tit-for-tat and Elastic. We conduct extensive experiments on real-world datasets, which showcase the effectiveness and accuracy of these two strategies.

Interactive Trimming against Evasive Online Data Manipulation Attacks: A Game-Theoretic Approach

TL;DR

An interactive game-theoretical model to defend online data manipulation attacks using the trimming strategy and two strategies are devised, namely, Tit-for-tat and Elastic, which are applicable to strong evasive and colluding adversaries.

Abstract

With the exponential growth of data and its crucial impact on our lives and decision-making, the integrity of data has become a significant concern. Malicious data poisoning attacks, where false values are injected into the data, can disrupt machine learning processes and lead to severe consequences. To mitigate these attacks, distance-based defenses, such as trimming, have been proposed, but they can be easily evaded by white-box attackers. The evasiveness and effectiveness of poisoning attack strategies are two sides of the same coin, making game theory a promising approach. However, existing game-theoretical models often overlook the complexities of online data poisoning attacks, where strategies must adapt to the dynamic process of data collection. In this paper, we present an interactive game-theoretical model to defend online data manipulation attacks using the trimming strategy. Our model accommodates a complete strategy space, making it applicable to strong evasive and colluding adversaries. Leveraging the principle of least action and the Euler-Lagrange equation from theoretical physics, we derive an analytical model for the game-theoretic process. To demonstrate its practical usage, we present a case study in a privacy-preserving data collection system under local differential privacy where a non-deterministic utility function is adopted. Two strategies are devised from this analytical model, namely, Tit-for-tat and Elastic. We conduct extensive experiments on real-world datasets, which showcase the effectiveness and accuracy of these two strategies.
Paper Structure (27 sections, 15 equations, 9 figures, 4 tables, 2 algorithms)

This paper contains 27 sections, 15 equations, 9 figures, 4 tables, 2 algorithms.

Figures (9)

  • Figure 1: The definition of $x_L$, and arbitrary poison value distributions represented by a mixed strategy point
  • Figure 2: Definition of $x_L$ and $x_R$ for a single poison value
  • Figure 3: An overview of the infinite game
  • Figure 4: K-means clustering results over Control, Vehicle, and Letter, Tth=0.9
  • Figure 5: K-means clustering results over Control, Vehicle, and Letter, Tth=0.97
  • ...and 4 more figures

Theorems & Definitions (4)

  • proof
  • proof
  • proof
  • proof