ValuePilot: A Two-Phase Framework for Value-Driven Decision-Making
Yitong Luo, Hou Hei Lam, Ziang Chen, Zhenliang Zhang, Xue Feng
TL;DR
ValuePilot tackles personalized, value-driven decision-making by introducing a two-phase framework: a dataset generation toolkit (DGT) that constructs value-annotated scenarios and actions via LLMs with automated filtering and human curation, and a decision-making module (DMM) that learns to recognize values and select actions. The DMM comprises a Value Assessment Network, built on a T5 encoder, and an Action Selection Module that combines objective value signals with user-specific preferences through Contextualized Scoring and PROMETHEE-based ranking. Across in-depth experiments, ValuePilot demonstrates strong alignment with human value preferences, outperforming open-source LLMs in value recognition and surpassing state-of-the-art LLMs in replicating human decision sequences and first-choice actions, aided by explicit merit of the PROMETHEE MCDM approach. The framework presents a scalable path toward interpretable, human-aligned AI that can generalize to novel scenarios while remaining sensitive to individual value profiles, albeit with limitations related to synthetic data, cultural variability, and scalability of value dimensions.
Abstract
Despite recent advances in artificial intelligence (AI), it poses challenges to ensure personalized decision-making in tasks that are not considered in training datasets. To address this issue, we propose ValuePilot, a two-phase value-driven decision-making framework comprising a dataset generation toolkit DGT and a decision-making module DMM trained on the generated data. DGT is capable of generating scenarios based on value dimensions and closely mirroring real-world tasks, with automated filtering techniques and human curation to ensure the validity of the dataset. In the generated dataset, DMM learns to recognize the inherent values of scenarios, computes action feasibility and navigates the trade-offs between multiple value dimensions to make personalized decisions. Extensive experiments demonstrate that, given human value preferences, our DMM most closely aligns with human decisions, outperforming Claude-3.5-Sonnet, Gemini-2-flash, Llama-3.1-405b and GPT-4o. This research is a preliminary exploration of value-driven decision-making. We hope it will stimulate interest in value-driven decision-making and personalized decision-making within the community.
