Multi-Agent Fact Checking
Ashwin Verma, Soheil Mohajer, Behrouz Touri
TL;DR
This work addresses online learning of unknown agent unreliabilities in distributed fact-checking, modeling each agent as a memoryless Binary Symmetric Channel with crossover $\pi_i$ and the statement stream as IID binary labels. It proposes a low-memory online estimator that uses a likelihood-ratio concept and log-odds weights to update unreliability estimates via a stochastic-approximation-type rule, connected to a mean-field ODE. The authors establish almost-sure convergence to an extended equilibrium set $\bar{\mathcal{E}}$ by constructing a KL-divergence Lyapunov function and extending the domain to handle boundary cases where some agents become fully reliable or unreliable. The results provide a principled, scalable approach to real-time reliability learning in crowdsourced/fact-checking settings, with rigorous convergence guarantees and insights into the role of boundary equilibria. Potential impact includes improved robustness and efficiency of automated fact-checking in platforms with multiple imperfect verifiers.
Abstract
We formulate the problem of fake news detection using distributed fact-checkers (agents) with unknown reliability. The stream of news/statements is modeled as an independent and identically distributed binary source (to represent true and false statements). Upon observing a news, agent $i$ labels the news as true or false which reflects the true validity of the statement with some probability $1-π_i$. In other words, agent $i$ misclassified each statement with error probability $π_i\in (0,1)$, where the parameter $π_i$ models the (un)trustworthiness of agent $i$. We present an algorithm to learn the unreliability parameters, resulting in a distributed fact-checking algorithm. Furthermore, we extensively analyze the discrete-time limit of our algorithm.
