ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making
Ziyang Guo, Yifan Wu, Jason Hartline, Kenneth Holstein, Jessica Hullman
TL;DR
ComplLLM tackles the challenge of identifying complementary information in multi-agent decision workflows by framing a decision-theoretic objective: surface signals from supervisor information that improve decision quality beyond an upstream model's recommendation. It introduces a three-stage approach—estimating the data-generating process, supervised fine-tuning with generated complementary signals, and reinforcement learning via Group Relative Policy Optimization—to elicit actionable, interpretable signals. The framework is validated across synthetic data and three real-world domains (radiology, content moderation, and scientific paper reviewing), demonstrating reliable recovery of complementary signals and improved downstream decision performance, including qualitative expert feedback in medicine. The work advances explainable AI for collaboration by shifting explanations from justification toward surfacing decision-relevant, complementary cues that decision-makers should consider in conjunction with existing recommendations.
Abstract
Multi-agent decision pipelines can outperform single agent workflows when complementarity holds, i.e., different agents bring unique information to the table to inform a final decision. We propose ComplLLM, a post-training framework based on decision theory that fine-tunes a decision-assistant LLM using complementary information as reward to output signals that complement existing agent decisions. We validate ComplLLM on synthetic and real-world tasks involving domain experts, demonstrating how the approach recovers known complementary information and produces plausible explanations of complementary signals to support downstream decision-makers.
