Solidago: A Modular Collaborative Scoring Pipeline

Lê Nguyên Hoang; Romain Beylerian; Bérangère Colbois; Julien Fageot; Louis Faucon; Aidan Jungo; Alain Le Noac'h; Adrien Matissart; Oscar Villemaud

Solidago: A Modular Collaborative Scoring Pipeline

Lê Nguyên Hoang, Romain Beylerian, Bérangère Colbois, Julien Fageot, Louis Faucon, Aidan Jungo, Alain Le Noac'h, Adrien Matissart, Oscar Villemaud

TL;DR

This paper presents Solidago, an end-to-end modular pipeline to allow any community of users to collaboratively score any number of entities, and proposes a six-module decomposition that uses pretrust and peer-to-peer vouches to assign trust scores to users.

Abstract

This paper presents Solidago, an end-to-end modular pipeline to allow any community of users to collaboratively score any number of entities. Solidago proposes a six-module decomposition. First, it uses pretrust and peer-to-peer vouches to assign trust scores to users. Second, based on participation, trust scores are turned into voting rights per user per entity. Third, for each user, a preference model is learned from the user's evaluation data. Fourth, users' models are put on a similar scale. Fifth, these models are securely aggregated. Sixth, models are post-processed to yield human-readable global scores. We also propose default implementations of the six modules, including a novel trust propagation algorithm, and adaptations of state-of-the-art scaling and aggregation solutions. Our pipeline has been successfully deployed on the open-source platform tournesol.app. We thereby lay an appealing foundation for the collaborative, effective, scalable, fair, interpretable and secure scoring of any set of entities.

Solidago: A Modular Collaborative Scoring Pipeline

TL;DR

Abstract

Paper Structure (90 sections, 18 theorems, 52 equations, 10 figures)

This paper contains 90 sections, 18 theorems, 52 equations, 10 figures.

Introduction
Contributions.
Related work.
Paper structure.
Problem formulation and data
Pretrust.
Vouch data.
Comparisons.
Privacy
Problem formulation.
Trust propagation
Sink vouch
LipschiTrust
Probabilistic intuition.
Defining LipschiTrust.
...and 75 more sections

Key Result

Proposition 3.1

There is a unique vector $\textsc{trust}_{}$ such that $\left\lVert{\textsc{tr}^{t}_{} - \textsc{trust}_{}}\right\rVert_{1} \leq U \beta^t$ for all $t$. Moreover, this vector verifies $\textsc{trust}_{} = \min \left\lbrace \textsc{trust}^{pre}_{} + \beta V^T \textsc{trust}_{}, 1 \right\rbrace$.

Figures (10)

Figure 1: This figure describes the Solidago pipeline (slightly simplified). Green boxes correspond to public data, while black boxes are kept hidden. The purple boxes contain both public and private data. The pipeline is composed of 6 steps, namely (1) trust propagation, (2) voting rights assignment, (3) preference learning, (4) model scaling, (5) model aggregation and (6) post-process.
Figure 2: Tournesol's users are asked to select any two videos and to report which they would recommend more widely.
Figure 3: Correlation between the ground truth and the learned global scores, as a function of the fraction of trustworthy users (left), and as a function of engagement bias (right).
Figure 4: Distribution of displayed user scores $\theta^{\bf display}_{u e}$ (top graph) and global $\rho^{\bf display}_{e}$ (bottom graph). Hyperparameters were selected so that users' scores have roughly a similar distribution, once they were compared at least 3 times, and so that the global scores of highly compared videos are well spread. Hyperparameter selection is further discussed in Appendix \ref{['sec:section']}.
Figure 5: Impact of the prior weight $\alpha_{prior}^{user}$ on the distributions of squashed individual and global scores
...and 5 more figures

Theorems & Definitions (36)

Proposition 3.1
proof : Proof sketch
Theorem 3.2
proof : Proof sketch
Proposition 3.3
Proposition 3.4
Proposition 5.1: Theorem 2 in GBT2023
Theorem 7.1: Lipschitz resilience of QrQtl
proof : Proof sketch
Lemma A.1
...and 26 more

Solidago: A Modular Collaborative Scoring Pipeline

TL;DR

Abstract

Solidago: A Modular Collaborative Scoring Pipeline

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (36)