Clone-Robust Weights in Metric Spaces: Handling Redundancy Bias for Benchmark Aggregation

Damien Berriaud; Roger Wattenhofer

Clone-Robust Weights in Metric Spaces: Handling Redundancy Bias for Benchmark Aggregation

Damien Berriaud, Roger Wattenhofer

TL;DR

A theoretical framework for dealing with clone-proof weighting functions, which distribute importance across elements of a set such that similar objects share (some of) their weights, thus avoiding a potential bias introduced by their multiplicity.

Abstract

We are given a set of elements in a metric space. The distribution of the elements is arbitrary, possibly adversarial. Can we weigh the elements in a way that is resistant to such (adversarial) manipulations? This problem arises in various contexts. For instance, the elements could represent data points, requiring robust domain adaptation. Alternatively, they might represent tasks to be aggregated into a benchmark; or questions about personal political opinions in voting advice applications. This article introduces a theoretical framework for dealing with such problems. We propose clone-proof weighting functions as a solution concept. These functions distribute importance across elements of a set such that similar objects (``clones'') share (some of) their weights, thus avoiding a potential bias introduced by their multiplicity. Our framework extends the maximum uncertainty principle to accommodate general metric spaces and includes a set of axioms -- symmetry, continuity, and clone-proofness -- that guide the construction of weighting functions. Finally, we address the existence of weighting functions satisfying our axioms in the significant case of Euclidean spaces and propose a general method for their construction.

Clone-Robust Weights in Metric Spaces: Handling Redundancy Bias for Benchmark Aggregation

TL;DR

Abstract

Clone-Robust Weights in Metric Spaces: Handling Redundancy Bias for Benchmark Aggregation

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (21)