Misinformation Resilient Search Rankings with Webgraph-based Interventions
Peter Carragher, Evan M. Williams, Kathleen M. Carley
TL;DR
This work studies misinformation-aware search ranking by designing webgraph-based interventions that penalize unreliable domains while sparing reliable ones. It presents two intervention classes—link scheme removal and link multiplicity weighting—and validates them through small-scale regression-based simulations and large-scale PageRank analyses, augmented by Anti-TrustRank and debiasing to improve fairness. The results show meaningful reductions in traffic and ranking for unreliable domains with modest collateral impact on reliable domains, and demonstrate that interventions can be tuned and extended (e.g., multi-category seed strategies) to limit unintended effects at web scale. Overall, the paper provides a principled, scalable blueprint for enhancing the trustworthiness of search results and offers practical mitigations for potential side effects, guiding future research and potential collaborations with search engines and regulators.
Abstract
The proliferation of unreliable news domains on the internet has had wide-reaching negative impacts on society. We introduce and evaluate interventions aimed at reducing traffic to unreliable news domains from search engines while maintaining traffic to reliable domains. We build these interventions on the principles of fairness (penalize sites for what is in their control), generality (label/fact-check agnostic), targeted (increase the cost of adversarial behavior), and scalability (works at webscale). We refine our methods on small-scale webdata as a testbed and then generalize the interventions to a large-scale webgraph containing 93.9M domains and 1.6B edges. We demonstrate that our methods penalize unreliable domains far more than reliable domains in both settings and we explore multiple avenues to mitigate unintended effects on both the small-scale and large-scale webgraph experiments. These results indicate the potential of our approach to reduce the spread of misinformation and foster a more reliable online information ecosystem. This research contributes to the development of targeted strategies to enhance the trustworthiness and quality of search engine results, ultimately benefiting users and the broader digital community.
