Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back

Renwei Meng

Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back

Renwei Meng

TL;DR

An Explainable Innovation Engine that upgrades the knowledge unit from text chunks to methods-as-nodes and maintains a weighted method provenance tree for traceable derivations and a hierarchical clustering abstraction tree for efficient top-down navigation is proposed.

Abstract

Retrieval-augmented generation (RAG) improves factual grounding, yet most systems rely on flat chunk retrieval and provide limited control over multi-step synthesis. We propose an Explainable Innovation Engine that upgrades the knowledge unit from text chunks to methods-as-nodes. The engine maintains a weighted method provenance tree for traceable derivations and a hierarchical clustering abstraction tree for efficient top-down navigation. At inference time, a strategy agent selects explicit synthesis operators (e.g., induction, deduction, analogy), composes new method nodes, and records an auditable trajectory. A verifier-scorer layer then prunes low-quality candidates and writes validated nodes back to support continual growth. Expert evaluation across six domains and multiple backbones shows consistent gains over a vanilla baseline, with the largest improvements on derivation-heavy settings, and ablations confirm the complementary roles of provenance backtracking and pruning. These results suggest a practical path toward controllable, explainable, and verifiable innovation in agentic RAG systems. Code is available at the project GitHub repository https://github.com/xiaolu-666113/Dual-Tree-Agent-RAG.

Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back

TL;DR

Abstract

Paper Structure (61 sections, 29 equations, 4 figures, 5 tables, 1 algorithm)

This paper contains 61 sections, 29 equations, 4 figures, 5 tables, 1 algorithm.

Introduction
Contributions.
Paper organization.
Related Work
Retrieval-Augmented Generation and Structured Indexing
Retrieval Strategy, Reflection, and Evaluation
Agentic Search and Verification Loops
Summary and Gap
Methods
Two-Tree Knowledge Representation
Method Provenance Tree
Clustering Abstraction Tree
Offline Construction: From Multimodal Sources to Two Trees
Multimodal Normalization
Context-Preserving Segmentation
...and 46 more sections

Figures (4)

Figure 1: The complete algorithm process
Figure 2: A schematic diagram of Clustering Abstraction Tree and Method Provenance Tree
Figure 3: Ablation results: score drop when removing each module. Larger values indicate higher sensitivity.
Figure 4: Cost--quality trade-off: expert score vs. token cost, colored by latency. The curve indicates the Pareto frontier.

Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back

TL;DR

Abstract

Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back

Authors

TL;DR

Abstract

Table of Contents

Figures (4)