Towards Adaptive Mechanism Activation in Language Agent
Ziyang Huang, Jun Zhao, Kang Liu
TL;DR
ALAMA tackles rigidity in Language Agents by enabling adaptive activation of multiple mechanisms through UniAct, which unifies workflows by explicit Actions. It introduces self-exploration for data-efficient training and two optimization modules, Implicit Mechanism Activation Optimization ($L_{IMAO}$) and Mechanism Activation Adaptability Optimization ($L_{MAAO}$), based on a $KTO$-based objective. Empirical results on mathematical and knowledge-intensive reasoning show ALAMA surpassing fixed-mechanism baselines and strong fine-tuning approaches, with solid generalization to held-out tasks and robustness over majority voting. The work demonstrates a practical, data-efficient path toward more dynamic, context-sensitive Language Agents that can select appropriate solution strategies across varied tasks.
Abstract
Language Agent could be endowed with different mechanisms for autonomous task accomplishment. Current agents typically rely on fixed mechanisms or a set of mechanisms activated in a predefined order, limiting their adaptation to varied potential task solution structures. To this end, this paper proposes \textbf{A}daptive \textbf{L}anguage \textbf{A}gent \textbf{M}echanism \textbf{A}ctivation Learning with Self-Exploration (\textbf{ALAMA}), which focuses on optimizing mechanism activation adaptability without reliance on expert models. Initially, it builds a harmonized agent framework (\textbf{UniAct}) to \textbf{Uni}fy different mechanisms via \textbf{Act}ions. Then it leverages a training-efficient optimization method based on self-exploration to enable the UniAct to adaptively activate the appropriate mechanisms according to the potential characteristics of the task. Experimental results demonstrate significant improvements in downstream agent tasks, affirming the effectiveness of our approach in facilitating more dynamic and context-sensitive mechanism activation.
