Human Behavior Atlas: Benchmarking Unified Psychological and Social Behavior Understanding

Keane Ong; Wei Dai; Carol Li; Dewei Feng; Hengzhi Li; Jingyao Wu; Jiaee Cheong; Rui Mao; Gianmarco Mengaldo; Erik Cambria; Paul Pu Liang

Human Behavior Atlas: Benchmarking Unified Psychological and Social Behavior Understanding

Keane Ong, Wei Dai, Carol Li, Dewei Feng, Hengzhi Li, Jingyao Wu, Jiaee Cheong, Rui Mao, Gianmarco Mengaldo, Erik Cambria, Paul Pu Liang

TL;DR

The paper addresses fragmentation in psychological and social behavior understanding by creating a unified, large-scale multimodal benchmark. It introduces Human Behavior Atlas, which standardizes taxonomy, data formats, evaluation, and augments data with behavioral descriptors to enable unified learning across tasks. Training three vanilla OmniSapiens-7B variants (SFT, BAM, RL) on the atlas yields superior performance relative to existing multimodal LLMs in diverse tasks, with RL particularly strong on open-ended generation; pretraining also improves transfer to held-out and novel tasks. The work demonstrates that structured benchmarks plus descriptor-based adaptations can meaningfully advance unified models for complex human behaviors and provides methodological guidance for future atlas construction.

Abstract

Using intelligent systems to perceive psychological and social behaviors, that is, the underlying affective, cognitive, and pathological states that are manifested through observable behaviors and social interactions, remains a challenge due to their complex, multifaceted, and personalized nature. Existing work tackling these dimensions through specialized datasets and single-task systems often miss opportunities for scalability, cross-task transfer, and broader generalization. To address this gap, we curate Human Behavior Atlas, a unified benchmark of diverse behavioral tasks designed to support the development of unified models for understanding psychological and social behaviors. Human Behavior Atlas comprises over 100,000 samples spanning text, audio, and visual modalities, covering tasks on affective states, cognitive states, pathologies, and social processes. Our unification efforts can reduce redundancy and cost, enable training to scale efficiently across tasks, and enhance generalization of behavioral features across domains. On Human Behavior Atlas, we train three models: OmniSapiens-7B SFT, OmniSapiens-7B BAM, and OmniSapiens-7B RL. We show that training on Human Behavior Atlas enables models to consistently outperform existing multimodal LLMs across diverse behavioral tasks. Pretraining on Human Behavior Atlas also improves transfer to novel behavioral datasets; with the targeted use of behavioral descriptors yielding meaningful performance gains.

Human Behavior Atlas: Benchmarking Unified Psychological and Social Behavior Understanding

TL;DR

Abstract

Human Behavior Atlas: Benchmarking Unified Psychological and Social Behavior Understanding

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)