AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction
Ruijie Shi, Houbin Zhang, Yuecheng Han, Yuheng Wang, Jingru Fan, Runde Yang, Yufan Dang, Huatao Li, Dewen Liu, Yuan Cheng, Chen Qian
TL;DR
This work tackles the opacity of agentic systems by addressing interpretability via Agentic Workflow Reconstruction (AWR), which aims to synthesize an explicit white-box surrogate workflow from input–output data. It introduces AgentXRay, an MCTS-based framework that searches a unified primitive space of agent/tool primitives and applies a dynamic Red-Black Pruning strategy to manage the combinatorial explosion. Across five diverse domains, AgentXRay achieves higher proxy fidelity (Static Functional Equivalence) than behavior cloning and unpruned baselines, while significantly improving search efficiency and enabling deeper exploration under fixed budgets. The results demonstrate that editable, interpretable workflows can approximate complex black-box agentic systems, offering a practical path toward transparency, debugging, and reuse, with open questions about richer workflow graphs and evaluator design for broader domains.
Abstract
Large Language Models have shown strong capabilities in complex problem solving, yet many agentic systems remain difficult to interpret and control due to opaque internal workflows. While some frameworks offer explicit architectures for collaboration, many deployed agentic systems operate as black boxes to users. We address this by introducing Agentic Workflow Reconstruction (AWR), a new task aiming to synthesize an explicit, interpretable stand-in workflow that approximates a black-box system using only input--output access. We propose AgentXRay, a search-based framework that formulates AWR as a combinatorial optimization problem over discrete agent roles and tool invocations in a chain-structured workflow space. Unlike model distillation, AgentXRay produces editable white-box workflows that match target outputs under an observable, output-based proxy metric, without accessing model parameters. To navigate the vast search space, AgentXRay employs Monte Carlo Tree Search enhanced by a scoring-based Red-Black Pruning mechanism, which dynamically integrates proxy quality with search depth. Experiments across diverse domains demonstrate that AgentXRay achieves higher proxy similarity and reduces token consumption compared to unpruned search, enabling deeper workflow exploration under fixed iteration budgets.
