GRATR: Zero-Shot Evidence Graph Retrieval-Augmented Trustworthiness Reasoning

Ying Zhu; Shengchang Li; Ziqian Kong; Qiang Yang; Peilan Xu

GRATR: Zero-Shot Evidence Graph Retrieval-Augmented Trustworthiness Reasoning

Ying Zhu, Shengchang Li, Ziqian Kong, Qiang Yang, Peilan Xu

TL;DR

GRATR addresses trustworthiness reasoning in incomplete-information settings by combining a dynamically updated trustworthiness graph with multi-hop retrieval to augment LLM reasoning in a zero-shot manner. The framework initializes and maintains a graph where nodes represent players and edges encode observed interactions and trust levels, updating these relations as new observations arrive. Through forward retrieval, backward updates, and reasoning, GRATR aggregates evidence chains from trusted sources to refine trust assessments toward a target and informs LLM-driven decisions with transparent, time-stamped rationale. Empirical results in the Werewolf game show GRATR outperforms baselines in win rate and reduces hallucinations, while a Twitter intent analysis benchmark demonstrates superior accuracy and macro F1, indicating robust applicability to real-world, language-rich domains.

Abstract

Trustworthiness reasoning aims to enable agents in multiplayer games with incomplete information to identify potential allies and adversaries, thereby enhancing decision-making. In this paper, we introduce the graph retrieval-augmented trustworthiness reasoning (GRATR) framework, which retrieves observable evidence from the game environment to inform decision-making by large language models (LLMs) without requiring additional training, making it a zero-shot approach. Within the GRATR framework, agents first observe the actions of other players and evaluate the resulting shifts in inter-player trust, constructing a corresponding trustworthiness graph. During decision-making, the agent performs multi-hop retrieval to evaluate trustworthiness toward a specific target, where evidence chains are retrieved from multiple trusted sources to form a comprehensive assessment. Experiments in the multiplayer game \emph{Werewolf} demonstrate that GRATR outperforms the alternatives, improving reasoning accuracy by 50.5\% and reducing hallucination by 30.6\% compared to the baseline method. Additionally, when tested on a dataset of Twitter tweets during the U.S. election period, GRATR surpasses the baseline method by 10.4\% in accuracy, highlighting its potential in real-world applications such as intent analysis.

GRATR: Zero-Shot Evidence Graph Retrieval-Augmented Trustworthiness Reasoning

TL;DR

Abstract

Paper Structure (19 sections, 9 equations, 6 figures, 4 tables, 2 algorithms)

This paper contains 19 sections, 9 equations, 6 figures, 4 tables, 2 algorithms.

Introduction
Preliminary
Methodology
Initialization of the Trustworthiness Graph
Update of the Trustworthiness Graph
Evidence Merging
Graph Retrieval Augmented Reasoning
Forward Retrieval
Backward Update
Reasoning
Experiments
Experiment on Werewolf Game
Win Rate Analysis
Action Scores
Hallucination Detection
...and 4 more sections

Figures (6)

Figure 1: Illustration of trustworthiness reasoning. Agent observes the actions of other players to gather evidence, and then evaluates inter-player trust and informs decision-making.
Figure 2: The overall framework of GRATR: Step 1. An agent participates in the game as player 1, and initializes a trustworthiness graph $G$. Step 2. When player 1 receives a new observation $o^t_1(p_3)$ following an action by $a^t_3$ at time $t$, it uses an LLM to extract the action into new evidence and its credibility and then updates and merges evidence on the graph $G^t$. Step 3. Player 1 obtains multiple evidence chains by multi-hop retrieval and updates the trustworthiness of player 4. Step 4. Update the trustworthiness of player 4 towards player 2 and player 3.
Figure 3: GRATR vs. baseline.
Figure 4: GRATR vs. NativeRAG.
Figure 5: GRATR vs. RerankRAG.
...and 1 more figures

GRATR: Zero-Shot Evidence Graph Retrieval-Augmented Trustworthiness Reasoning

TL;DR

Abstract

GRATR: Zero-Shot Evidence Graph Retrieval-Augmented Trustworthiness Reasoning

Authors

TL;DR

Abstract

Table of Contents

Figures (6)