FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems
Arya Fayyazi, Mehdi Kamal, Massoud Pedram
TL;DR
FACTER proposes a post hoc framework that unites conformal prediction with iterative prompt engineering to mitigate biases in black-box LLM-based recommender systems. It defines a fairness-aware non-conformity score that blends predictive discrepancy with cross-group embedding dissimilarities and uses offline calibration to set a robust threshold, updated online as violations occur. By injecting explicit bias-avoidance patterns into the system prompt and adjusting the threshold adaptively, FACTER achieves substantial reductions in fairness violations (up to 95.5%) on MovieLens-1M and Amazon datasets while preserving competitive recommendation accuracy. The approach requires no retraining and provides statistical guarantees, offering a scalable, black-box-friendly path to fair LLM-driven recommendations with practical impact for real-world deployment.
Abstract
We propose FACTER, a fairness-aware framework for LLM-based recommendation systems that integrates conformal prediction with dynamic prompt engineering. By introducing an adaptive semantic variance threshold and a violation-triggered mechanism, FACTER automatically tightens fairness constraints whenever biased patterns emerge. We further develop an adversarial prompt generator that leverages historical violations to reduce repeated demographic biases without retraining the LLM. Empirical results on MovieLens and Amazon show that FACTER substantially reduces fairness violations (up to 95.5%) while maintaining strong recommendation accuracy, revealing semantic variance as a potent proxy of bias.
