InstructPipe: Generating Visual Blocks Pipelines with Human Instructions and LLMs
Zhongyi Zhou, Jing Jin, Vrushank Phadnis, Xiuxiu Yuan, Jun Jiang, Xun Qian, Kristen Wright, Mark Sherwood, Jason Mayes, Jingtao Zhou, Yiyi Huang, Zheng Xu, Yinda Zhang, Johnny Lee, Alex Olwal, David Kim, Ram Iyengar, Na Li, Ruofei Du
TL;DR
InstructPipe presents a novel AI-assisted visual programming workflow for ML pipelines that uses two specialized LLM modules to generate a pipeline specification via pseudocode and a code interpreter to render the pipeline in a node-graph editor. By offloading node selection and connection to language-driven components, the system significantly reduces interactive effort and accelerates prototyping, demonstrated through technical and user evaluations with novice users. The study highlights substantial workload reductions and improved onboarding, while also revealing cognitive-load and prompting challenges that motivate future work on scalable node libraries, dynamic interactions, and responsible AI practices. Overall, InstructPipe advances human–AI collaboration for rapid, open-ended ML prototyping in visual programming environments, with practical implications for education and rapid concept exploration.
Abstract
Visual programming has the potential of providing novice programmers with a low-code experience to build customized processing pipelines. Existing systems typically require users to build pipelines from scratch, implying that novice users are expected to set up and link appropriate nodes from a blank workspace. In this paper, we introduce InstructPipe, an AI assistant for prototyping machine learning (ML) pipelines with text instructions. We contribute two large language model (LLM) modules and a code interpreter as part of our framework. The LLM modules generate pseudocode for a target pipeline, and the interpreter renders the pipeline in the node-graph editor for further human-AI collaboration. Both technical and user evaluation (N=16) shows that InstructPipe empowers users to streamline their ML pipeline workflow, reduce their learning curve, and leverage open-ended commands to spark innovative ideas.
