Be.FM: Open Foundation Models for Human Behavior
Yutong Xie, Zhuoheng Li, Xiyuan Wang, Yijun Pan, Qijia Liu, Xingzhi Cui, Kuang-Yu Lo, Ruoyi Gao, Xingjian Zhang, Jin Huang, Walter Yuan, Matthew O. Jackson, Qiaozhu Mei
TL;DR
Be.FM introduces an open, foundation-style model specifically trained to model human behavior by integrating four broad data sources—literature, experimental, survey, and observational data—within a formal framework $y = \mathcal{F}(\mathbb{K}, x, c)$. Built on open LLMs (LLaMA-3.1 backbones) and fine-tuned with LoRA/SFT on Alpaca-formatted data, Be.FM is evaluated through a comprehensive benchmark spanning behavior prediction, personality inference, contextual insight, and behavioral reasoning. Across tasks, Be.FM demonstrates stronger alignment with human distributions and improved inference on individual and population traits, while also generating plausible contextual hypotheses and supporting research workflow reasoning. The work highlights Be.FM as a step toward a new AI-assisted interface for behavioral science, capable of scalable analysis and cross-domain applications, with future directions including expanded data, reinforcement learning, and broader community engagement.
Abstract
Despite their success in numerous fields, the potential of foundation models for modeling and understanding human behavior remains largely unexplored. We introduce Be.FM, one of the first open foundation models designed for human behavior modeling. Built upon open-source large language models and fine-tuned on a diverse range of behavioral data, Be.FM can be used to understand and predict human decision-making. We construct a comprehensive set of benchmark tasks for testing the capabilities of behavioral foundation models. Our results demonstrate that Be.FM can predict behaviors, infer characteristics of individuals and populations, generate insights about contexts, and apply behavioral science knowledge.
