ToolFactory: Automating Tool Generation by Leveraging LLM to Understand REST API Documentations
Xinyi Ni, Qiuyang Wang, Yukun Zhang, Pengyu Hong
TL;DR
ToolFactory provides an end-to-end, open-source pipeline to automatically generate AI-usable tools from diverse REST API documents, addressing the lack of standardization in scientific APIs. It combines APILLAMA for structured information extraction, a JSON-to-tool generation process, rigorous tool validation, and a knowledge-base–driven parameter value inference mechanism. The API Extraction Benchmark and the glycomaterials case study demonstrate that APILLAMA achieves strong structural accuracy while enabling practical tool creation, cross-database integration, and domain-agnostic applicability. This work lowers the development and learning costs for integrating scientific REST APIs into AI workflows, enabling domain-specific agents to operate with reduced manual engineering effort.
Abstract
LLM-based tool agents offer natural language interfaces, enabling users to seamlessly interact with computing services. While REST APIs are valuable resources for building such agents, they must first be transformed into AI-compatible tools. Automatically generating AI-compatible tools from REST API documents can greatly streamline tool agent development and minimize user learning curves. However, API documentation often suffers from a lack of standardization, inconsistent schemas, and incomplete information. To address these issues, we developed \textbf{ToolFactory}, an open-source pipeline for automating tool generation from unstructured API documents. To enhance the reliability of the developed tools, we implemented an evaluation method to diagnose errors. Furthermore, we built a knowledge base of verified tools, which we leveraged to infer missing information from poorly documented APIs. We developed the API Extraction Benchmark, comprising 167 API documents and 744 endpoints in various formats, and designed a JSON schema to annotate them. This annotated dataset was utilized to train and validate ToolFactory. The experimental results highlight the effectiveness of ToolFactory. We also demonstrated ToolFactory by creating a domain-specific AI agent for glycomaterials research. ToolFactory exhibits significant potential for facilitating the seamless integration of scientific REST APIs into AI workflows.
