Human Control Is the Anchor, Not the Answer: Early Divergence of Oversight in Agentic AI Communities

Hanjing Shi; Dominic DiFranzo

Human Control Is the Anchor, Not the Answer: Early Divergence of Oversight in Agentic AI Communities

Hanjing Shi, Dominic DiFranzo

TL;DR

The paper investigates how early-stage oversight expectations for agentic AI crystallize in public online communities. It compares two role-differentiated ecosystems, r/OpenClaw and r/Moltbook, using a joint topic-modeling framework, an oversight-theme abstraction, and engagement-weighted salience to reveal how the concept of 'human control' functions as an anchor but accrues distinct meanings across contexts. The results show that while both communities discuss human control, OpenClaw emphasizes execution guardrails and resource constraints, whereas Moltbook centers on legitimacy, identity, and social accountability, with strong overall separability ($JSD=0.418$, cosine $=0.372$, $p=0.0005$). This highlights the need for role-sensitive oversight design and governance tools that align with ecosystem-specific expectations, and it provides a practical, scalable method for analyzing early discourse to guide auditing, governance, and design before formal standards emerge.

Abstract

Oversight for agentic AI is often discussed as a single goal ("human control"), yet early adoption may produce role-specific expectations. We present a comparative analysis of two newly active Reddit communities in Jan--Feb 2026 that reflect different socio-technical roles: r/OpenClaw (deployment and operations) and r/Moltbook (agent-centered social interaction). We conceptualize this period as an early-stage crystallization phase, where oversight expectations form before norms reach equilibrium. Using topic modeling in a shared comparison space, a coarse-grained oversight-theme abstraction, engagement-weighted salience, and divergence tests, we show the communities are strongly separable (JSD =0.418, cosine =0.372, permutation $p=0.0005$). Across both communities, "human control" is an anchor term, but its operational meaning diverges: r/OpenClaw} emphasizes execution guardrails and recovery (action-risk), while r/Moltbook} emphasizes identity, legitimacy, and accountability in public interaction (meaning-risk). The resulting distinction offers a portable lens for designing and evaluating oversight mechanisms that match agent role, rather than applying one-size-fits-all control policies.

Human Control Is the Anchor, Not the Answer: Early Divergence of Oversight in Agentic AI Communities

TL;DR

, cosine

). This highlights the need for role-sensitive oversight design and governance tools that align with ecosystem-specific expectations, and it provides a practical, scalable method for analyzing early discourse to guide auditing, governance, and design before formal standards emerge.

Abstract

). Across both communities, "human control" is an anchor term, but its operational meaning diverges: r/OpenClaw} emphasizes execution guardrails and recovery (action-risk), while r/Moltbook} emphasizes identity, legitimacy, and accountability in public interaction (meaning-risk). The resulting distinction offers a portable lens for designing and evaluating oversight mechanisms that match agent role, rather than applying one-size-fits-all control policies.

Paper Structure (23 sections, 1 equation, 1 figure, 5 tables)

This paper contains 23 sections, 1 equation, 1 figure, 5 tables.

Introduction
Related Work
Human--AI Teaming and Oversight
Early Agent Communities and In-the-Wild Discourse
Data and Methods
Data Scope and Unit of Analysis
Topic Modeling
Oversight Theme Mapping
Divergence and Salience Metrics
Results
Data Overview
Topic Structure (Separate and Combined)
Oversight Theme Distribution (Combined Reference Space)
Subreddit Separability and Divergence
Salience Analysis (Descriptive)
...and 8 more sections

Figures (1)

Figure 1: Post-level salience across oversight themes. For each subreddit $s$ and theme $t$, salience is $\text{Prevalence}_{s,t}\times\text{MeanEngagement}_{s,t}$ (equivalently, the sum of post scores for posts mapped to $(s,t)$). Themes shown are the union of topic-to-theme assignments from the two separate $k{=}3$ subreddit LDA models (four themes appear because multiple topics map to the same theme). This heatmap is descriptive; divergence statistics are computed only in the combined-corpus LDA reference space.

Human Control Is the Anchor, Not the Answer: Early Divergence of Oversight in Agentic AI Communities

TL;DR

Abstract

Human Control Is the Anchor, Not the Answer: Early Divergence of Oversight in Agentic AI Communities

Authors

TL;DR

Abstract

Table of Contents

Figures (1)