ElicitationGPT: Text Elicitation Mechanisms via Language Models
Yifan Wu, Jason Hartline
TL;DR
This work develops ElicitationGPT, a textual information elicitation framework that reduces open-ended text to a numerical forecast using domain-knowledge-free LLM queries (summarization and QA) and proper scoring rules. The authors prove conditions for (approximate) properness and demonstrate adversarial robustness, applying the approach to peer grading data where ground-truth alignment with instructor scores and overall student performance is strong. By treating LLMs as oracle-based components within an algorithmic AI paradigm, the paper provides guarantees beyond direct prompting and shows text-based scoring can outperform traditional numeric rubrics in capturing true performance. The methodology offers a scalable, guarantee-bearing avenue for high-quality textual data elicitation with broad potential applications beyond education.
Abstract
Scoring rules evaluate probabilistic forecasts of an unknown state against the realized state and are a fundamental building block in the incentivized elicitation of information. This paper develops mechanisms for scoring elicited text against ground truth text by reducing the textual information elicitation problem to a forecast elicitation problem, via domain-knowledge-free queries to a large language model (specifically ChatGPT), and empirically evaluates their alignment with human preferences. Our theoretical analysis shows that the reduction achieves provable properness via black-box language models. The empirical evaluation is conducted on peer reviews from a peer-grading dataset, in comparison to manual instructor scores for the peer reviews. Our results suggest a paradigm of algorithmic artificial intelligence that may be useful for developing artificial intelligence technologies with provable guarantees.
