Table of Contents
Fetching ...

A Reference Architecture for Designing Foundation Model based Systems

Qinghua Lu, Liming Zhu, Xiwei Xu, Zhenchang Xing, Jon Whittle

TL;DR

This paper tackles the lack of systematic architectural guidance for foundation-model (FM) based systems in the context of responsible AI and evolving interface boundaries. It proposes a pattern-oriented reference architecture with three layers (system, operation, supply chain) and seven key design decisions to balance FM capabilities with governance, safety, and maintainability. The architecture is empirically evaluated by mapping to a responsible AI chatbot, incorporating retrieval augmented generation, verifiers, think-aloud explanations, multi-agent orchestration, and an AIBOM registry. The work offers concrete, actionable guidance for building adaptable, responsible FM-based systems and points toward a pattern catalogue to expand design coverage over time.

Abstract

The release of ChatGPT, Gemini, and other large language model has drawn huge interests on foundations models. There is a broad consensus that foundations models will be the fundamental building blocks for future AI systems. However, there is a lack of systematic guidance on the architecture design. Particularly, the the rapidly growing capabilities of foundations models can eventually absorb other components of AI systems, posing challenges of moving boundary and interface evolution in architecture design. Furthermore, incorporating foundations models into AI systems raises significant concerns about responsible and safe AI due to their opaque nature and rapidly advancing intelligence. To address these challenges, the paper first presents an architecture evolution of AI systems in the era of foundation models, transitioning from "foundation-model-as-a-connector" to "foundation-model-as-a-monolithic architecture". The paper then identifies key design decisions and proposes a pattern-oriented reference architecture for designing responsible foundation-model-based systems. The patterns can enable the potential of foundation models while ensuring associated risks.

A Reference Architecture for Designing Foundation Model based Systems

TL;DR

This paper tackles the lack of systematic architectural guidance for foundation-model (FM) based systems in the context of responsible AI and evolving interface boundaries. It proposes a pattern-oriented reference architecture with three layers (system, operation, supply chain) and seven key design decisions to balance FM capabilities with governance, safety, and maintainability. The architecture is empirically evaluated by mapping to a responsible AI chatbot, incorporating retrieval augmented generation, verifiers, think-aloud explanations, multi-agent orchestration, and an AIBOM registry. The work offers concrete, actionable guidance for building adaptable, responsible FM-based systems and points toward a pattern catalogue to expand design coverage over time.

Abstract

The release of ChatGPT, Gemini, and other large language model has drawn huge interests on foundations models. There is a broad consensus that foundations models will be the fundamental building blocks for future AI systems. However, there is a lack of systematic guidance on the architecture design. Particularly, the the rapidly growing capabilities of foundations models can eventually absorb other components of AI systems, posing challenges of moving boundary and interface evolution in architecture design. Furthermore, incorporating foundations models into AI systems raises significant concerns about responsible and safe AI due to their opaque nature and rapidly advancing intelligence. To address these challenges, the paper first presents an architecture evolution of AI systems in the era of foundation models, transitioning from "foundation-model-as-a-connector" to "foundation-model-as-a-monolithic architecture". The paper then identifies key design decisions and proposes a pattern-oriented reference architecture for designing responsible foundation-model-based systems. The patterns can enable the potential of foundation models while ensuring associated risks.
Paper Structure (16 sections, 2 figures)