Foundation Model Engineering: Engineering Foundation Models Just as Engineering Software
Dezhi Ran, Mengzhou Wu, Wei Yang, Tao Xie
TL;DR
Foundation Models are expanding in capability but complexity risks a development crisis unless data and model management are systematized. The authors propose Foundation Model Engineering (FME), a framework of declarative, automated interfaces for data and model management, complemented by data-wrangling with weak supervision and a Git-like, collaborative workflow for models. Key contributions include a data-management architecture emphasizing labeling, cleaning, access control, and auditing, and a model-management architecture featuring distributed versioning, Fisher-information-guided updates, and modular mixture-of-experts with automated finetuning and deployment. The work outlines three research pillars—Declarative FM Engineering, Fine-grained Data Management, and Automating Model Management—along with practical implications for tooling, governance, and scalable, responsible FM development.
Abstract
By treating data and models as the source code, Foundation Models (FMs) become a new type of software. Mirroring the concept of software crisis, the increasing complexity of FMs making FM crisis a tangible concern in the coming decade, appealing for new theories and methodologies from the field of software engineering. In this paper, we outline our vision of introducing Foundation Model (FM) engineering, a strategic response to the anticipated FM crisis with principled engineering methodologies. FM engineering aims to mitigate potential issues in FM development and application through the introduction of declarative, automated, and unified programming interfaces for both data and model management, reducing the complexities involved in working with FMs by providing a more structured and intuitive process for developers. Through the establishment of FM engineering, we aim to provide a robust, automated, and extensible framework that addresses the imminent challenges, and discovering new research opportunities for the software engineering field.
