Table of Contents
Fetching ...

Berta: an open-source, modular tool for AI-enabled clinical documentation

Samridhi Vaid, Mike Weldon, Jesse Dunn, Sacha Davis, Kevin Lonergan, Henry Li, Jeffrey Franc, Mohamed Abdalla, Daniel C. Baumgart, Jake Hayward, J Ross Mitchell

Abstract

Commercial AI scribes cost \$99-600 per physician per month, operate as opaque systems, and do not return data to institutional infrastructure, limiting organizational control over data governance, quality improvement, and clinical workflows. We developed Berta, an open-source modular scribe platform for AI-enabled clinical documentation, and deployed a customized implementation within Alberta Health Services (AHS) integrated with their existing Snowflake AI Data Cloud infrastructure. The system combines automatic speech recognition with large language models while retaining all clinical data within the secure AHS environment. During eight months (November 2024 to July 2025), 198 emergency physicians used the system in 105 urban and rural facilities, generating 22148 clinical sessions and more than 2800 hours of audio. The use grew from 680 to 5530 monthly sessions. Operating costs averaged less than \$30 per physician per month, a 70-95% reduction compared to commercial alternatives. AHS has since approved expansion to 850 physicians. This is the first provincial-scale deployment of an AI scribe integrated with existing health system infrastructure. By releasing Berta as open source, we provide a reproducible, cost-effective alternative that health systems can adapt to their own secure environments, supporting data sovereignty and informed evaluation of AI documentation technology.

Berta: an open-source, modular tool for AI-enabled clinical documentation

Abstract

Commercial AI scribes cost \30 per physician per month, a 70-95% reduction compared to commercial alternatives. AHS has since approved expansion to 850 physicians. This is the first provincial-scale deployment of an AI scribe integrated with existing health system infrastructure. By releasing Berta as open source, we provide a reproducible, cost-effective alternative that health systems can adapt to their own secure environments, supporting data sovereignty and informed evaluation of AI documentation technology.
Paper Structure (18 sections, 2 figures)

This paper contains 18 sections, 2 figures.

Figures (2)

  • Figure 1: The Berta user interface. The left sidebar shows a chronological list of sessions, while the main area displays the audio waveform and the automatically generated medical note below. All data shown are from simulated patient sessions to protect privacy; no real patient information is used.
  • Figure 2: Berta System Architecture. The system features a multi-layer design with a Next.js front-end and a Python FastAPI backend coupled with an LLM inference engine and a speech-to-text module. Multiple inference engines are supported including: vLLM, Ollama, LM Studio, and any OpenAI compatible API. Multiple speech-to-text modules are also supported including: WhisperX, Amazon Transcribe, Nvidia Parakeet V2, and others. Berta supports on-premises or cloud-based GPU acceleration and can be deployed securely within a virtual private cloud, with support for multiple user authentication systems. Icons courtesy of Wikicommons.