Trustworthy Machine Learning under Social and Adversarial Data Sources
Han Shao
TL;DR
This work develops a theoretical foundation for trustworthy machine learning when data come from social and adversarial sources. It provides a spectrum of results across strategic classification, federated incentives, active learning, multi-objective decision making, and robust learning under clean-label attacks, linking online and PAC learnability to manipulation power and information structure. Core contributions include logarithmic vs linear bounds for strategic learning under ball manipulations, the design of incentive-aware and stable-policy mechanisms in federated and collaborative contexts, and efficient algorithms for unknown-manipulation-graph and multi-objective learning with comparative feedback. The findings illuminate when learnability transfers under strategic behavior, propose practical algorithms (e.g., Strategic Halving, ADA-GD, stable-policy oracles), and establish fundamental limits and open problems for trustworthy ML in societally impactful data settings.
Abstract
Machine learning has witnessed remarkable breakthroughs in recent years. As machine learning permeates various aspects of daily life, individuals and organizations increasingly interact with these systems, exhibiting a wide range of social and adversarial behaviors. These behaviors may have a notable impact on the behavior and performance of machine learning systems. Specifically, during these interactions, data may be generated by strategic individuals, collected by self-interested data collectors, possibly poisoned by adversarial attackers, and used to create predictors, models, and policies satisfying multiple objectives. As a result, the machine learning systems' outputs might degrade, such as the susceptibility of deep neural networks to adversarial examples (Shafahi et al., 2018; Szegedy et al., 2013) and the diminished performance of classic algorithms in the presence of strategic individuals (Ahmadi et al., 2021). Addressing these challenges is imperative for the success of machine learning in societal settings.
