Citations and Trust in LLM Generated Responses
Yifan Ding, Matthew Facciani, Amrit Poudel, Ellen Joyce, Salvador Aguinaga, Balaji Veeramani, Sanmitra Bhattacharya, Tim Weninger
TL;DR
This paper investigates how citations influence user trust in large language model (LLM) responses through an anti-monitoring framework and social proof. It implements a live QA experiment where ChatGPT-4 outputs are shown with zero, one, or five citations that are either valid or random, measuring self-reported trust and whether participants check citations. The findings show that citations generally boost trust, even when random, while actively checking citations reduces trust, with no clear advantage to using more than a single citation. The work also analyzes question type and demographics, revealing that political and fact-based questions tend to be rated more trustworthy, and that citation checking behavior varies across groups. These insights have practical implications for designing Retrieval Augmented Generation systems and user interfaces that balance transparency with perceived reliability.
Abstract
Question answering systems are rapidly advancing, but their opaque nature may impact user trust. We explored trust through an anti-monitoring framework, where trust is predicted to be correlated with presence of citations and inversely related to checking citations. We tested this hypothesis with a live question-answering experiment that presented text responses generated using a commercial Chatbot along with varying citations (zero, one, or five), both relevant and random, and recorded if participants checked the citations and their self-reported trust in the generated responses. We found a significant increase in trust when citations were present, a result that held true even when the citations were random; we also found a significant decrease in trust when participants checked the citations. These results highlight the importance of citations in enhancing trust in AI-generated content.
