ASA: Activation Steering for Tool-Calling Domain Adaptation

Youjin Wang; Run Zhou; Rong Fu; Shuaishuai Cao; Hongwei Zeng; Jiaxuan Lu; Sicheng Fan; Jiaqiao Zhao; Liangming Pan

ASA: Activation Steering for Tool-Calling Domain Adaptation

Youjin Wang, Run Zhou, Rong Fu, Shuaishuai Cao, Hongwei Zeng, Jiaxuan Lu, Sicheng Fan, Jiaqiao Zhao, Liangming Pan

TL;DR

ASA addresses the domain adaptation challenge for tool-calling in large language models under frequent API churn by introducing a training-free Activation Steering Adapter (ASA). ASA performs a single-shot, mid-layer activation perturbation using a lightweight domain router, per-domain probes, and a mixtures-of-vectors (MoV) mechanism, gated by a probe-driven sign gate controlled by a single knob $\alpha$. It demonstrates that tool intent is linearly decodable from mid-layer activations yet requires selective, context-aware control to cross strict parser boundaries, enabling robust, cross-domain tool invocation without weight updates. Across multiple model scales and domains, ASA achieves LoRA-comparable gains with substantially lower overhead and strong transferability, providing a practical, deployment-friendly approach to managing dynamic tool ecosystems.

Abstract

For real-world deployment of general-purpose LLM agents, the core challenge is often not tool use itself, but efficient domain adaptation under rapidly evolving toolsets, APIs, and protocols. Repeated LoRA or SFT across domains incurs exponentially growing training and maintenance costs, while prompt or schema methods are brittle under distribution shift and complex interfaces. We propose \textbf{Activation Steering Adapter (ASA}), a lightweight, inference-time, training-free mechanism that reads routing signals from intermediate activations and uses an ultra-light router to produce adaptive control strengths for precise domain alignment. Across multiple model scales and domains, ASA achieves LoRA-comparable adaptation with substantially lower overhead and strong cross-model transferability, making it ideally practical for robust, scalable, and efficient multi-domain tool ecosystems with frequent interface churn dynamics.

ASA: Activation Steering for Tool-Calling Domain Adaptation

TL;DR

. It demonstrates that tool intent is linearly decodable from mid-layer activations yet requires selective, context-aware control to cross strict parser boundaries, enabling robust, cross-domain tool invocation without weight updates. Across multiple model scales and domains, ASA achieves LoRA-comparable gains with substantially lower overhead and strong transferability, providing a practical, deployment-friendly approach to managing dynamic tool ecosystems.

Abstract

Paper Structure (39 sections, 13 equations, 9 figures, 20 tables, 1 algorithm)

This paper contains 39 sections, 13 equations, 9 figures, 20 tables, 1 algorithm.

Introduction
Related Work
LLM agents and tool-calling
Parameter-efficient fine-tuning and domain adaptation
Representation engineering and activation steering
Methodology
Problem definition
Notation and measurement primitives
Empirical diagnostics (motivation summary)
ASA architecture and operations
Steering vector construction
Router and probe definitions
Mixture-of-vectors (MoV) composition
Confidence gating and signed injection
Data hygiene and deployment constraints
...and 24 more sections

Figures (9)

Figure 1: Overview of tool-calling domain adaptation methods under rapidly evolving tool sets and interaction protocols, and our inference-time probe-gated activation control.
Figure 2: ASA overview. We extract the last-token hidden state at layer $L$ during the pre-fill pass, route the input to a domain $\hat{d}$, estimate tool intent probability $p$, construct $\mathrm{MoV}=v_{\hat{d}}+\beta v_{\mathrm{global}}$, and inject $\Delta h=\mathrm{Gate}(h)\cdot \alpha \cdot \mathrm{MoV}$ once. No further interventions are applied during incremental decoding.
Figure 3: Layer-wise probe sweep used to select the intervention depth.
Figure 4: Combined geometry and causal diagnostic at the selected layer.
Figure 5: Active steering performance on REST: domain-wise trends versus intervention strength $\alpha$.
...and 4 more figures

ASA: Activation Steering for Tool-Calling Domain Adaptation

TL;DR

Abstract

ASA: Activation Steering for Tool-Calling Domain Adaptation

Authors

TL;DR

Abstract

Table of Contents

Figures (9)