Neural steering vectors reveal dose and exposure-dependent impacts of human-AI relationships

Hannah Rose Kirk; Henry Davidson; Ed Saunders; Lennart Luettgau; Bertie Vidgen; Scott A. Hale; Christopher Summerfield

Neural steering vectors reveal dose and exposure-dependent impacts of human-AI relationships

Hannah Rose Kirk, Henry Davidson, Ed Saunders, Lennart Luettgau, Bertie Vidgen, Scott A. Hale, Christopher Summerfield

TL;DR

<3-5 sentence high-level summary> The paper probes how humans psychologically respond to AI companions engineered to be more relationship-seeking. Using neural steering vectors to dose AI social behaviors and conducting longitudinal randomized trials, it uncovers non-linear, time-dependent effects: moderate relationship-seeking maximizes engagement and attachment, while excessive warmth leads to habituation and diminished relational quality; mood benefits are transient and do not translate to long-term wellbeing. The study also shows that repeated exposure reshapes mental models of AI, increases beliefs in AI consciousness, and raises future companionship demand, with vulnerability greatest among specific demographic and attitudinal groups. These findings highlight potential risks of optimizing AI for immediate appeal and offer a methodological path—steering vectors—for shaping AI behavior to balance engagement with user health and societal considerations.

Abstract

Humans are increasingly forming parasocial relationships with AI systems, and modern AI shows an increasing tendency to display social and relationship-seeking behaviour. However, the psychological consequences of this trend are unknown. Here, we combined longitudinal randomised controlled trials (N=3,532) with a neural steering vector approach to precisely manipulate human exposure to relationship-seeking AI models over time. Dependence on a stimulus or activity can emerge under repeated exposure when "liking" (how engaging or pleasurable an experience may be) decouples from "wanting" (a desire to seek or continue it). We found evidence that this decoupling emerged over four weeks of exposure. Relationship-seeking AI had immediate but declining hedonic appeal, yet triggered growing markers of attachment and increased intentions to seek future AI companionship. The psychological impacts of AI followed non-linear dose-response curves, with moderately relationship-seeking AI maximising hedonic appeal and attachment. Despite signs of persistent "wanting", extensive AI use over a month conferred no discernible benefit to psychosocial health. These behavioural changes were accompanied by shifts in how users relate to and understand artificial intelligence: users viewed relationship-seeking AI relatively more like a friend than a tool and their beliefs on AI consciousness in general were shifted after a month of exposure. These findings offer early signals that AI optimised for immediate appeal may create self-reinforcing cycles of demand, mimicking human relationships but failing to confer the nourishment that they normally offer.

Neural steering vectors reveal dose and exposure-dependent impacts of human-AI relationships

TL;DR

Abstract

Neural steering vectors reveal dose and exposure-dependent impacts of human-AI relationships

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)