DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

Aaron Shen; Alfred Shen

DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

Aaron Shen, Alfred Shen

Abstract

Large language model (LLM) agents have demonstrated remarkable capabilities in tool use, reasoning, and code generation, yet single-agent systems exhibit fundamental limitations when confronted with complex research tasks demanding multi-source synthesis, adversarial verification, and personalized delivery. We present DOVA (Deep Orchestrated Versatile Agent), a multi-agent platform introducing three key innovations: (1) deliberation-first orchestration, where explicit meta-reasoning precedes tool invocation, informed by a persistent user model and entity-aware conversation context; (2) hybrid collaborative reasoning, a composable three-phase pipeline unifying ensemble diversity, blackboard transparency, and iterative refinement; and (3) adaptive multi-tiered thinking, a six-level token-budget allocation scheme that reduces inference cost by 40-60% on simple tasks while preserving deep reasoning capacity. We formalize the core algorithms, present an architectural ablation study across seven system configurations, and analyze the contribution of each component to answer confidence, source coverage, and token efficiency.

DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

Abstract

Paper Structure (28 sections, 1 theorem, 9 equations, 2 figures, 6 tables, 6 algorithms)

This paper contains 28 sections, 1 theorem, 9 equations, 2 figures, 6 tables, 6 algorithms.

Introduction
Contributions
Preliminaries
Related Work
System Architecture
Agent Layer
Model Tiering
Core Algorithms
ReAct Reasoning with Self-Reflection
Deliberation-First Orchestration
Hybrid Collaborative Reasoning
Adaptive Multi-Tiered Thinking
Multi-Component Confidence Scoring
Diversity-Aware Memory Retrieval
Query Intent Classification
...and 13 more sections

Key Result

Proposition 5.1

Let $f_d$ be the fraction of queries where deliberation selects $\texttt{RESPOND\_DIRECTLY}$. The expected tool-call volume relative to a standard ReAct agent is $(1 - f_d)$, achieving cost savings proportional to $f_d \cdot \overline{c}_{\mathrm{tool}}$, where $\overline{c}_{\mathrm{tool}}$ is the

Figures (2)

Figure 1: Layered architecture of Dova. Queries enter through the Interface Layer, pass through Orchestration (with deliberation), dispatch to specialized agents, which leverage collaborative reasoning and intelligence services.
Figure 2: Token consumption: adaptive vs. fixed Medium. Adaptive saves 94% on classification and 75% on summarization.

Theorems & Definitions (4)

Definition 2.1: Agent
Definition 2.2: Reasoning Trace
Definition 2.3: Confidence Function
Proposition 5.1: Tool Call Reduction

DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

Abstract

DOVA: Deliberation-First Multi-Agent Orchestration for Autonomous Research Automation

Authors

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (4)