Table of Contents
Fetching ...

Jaco: An Offline Running Privacy-aware Voice Assistant

Daniel Bermuth, Alexander Poeppel, Wolfgang Reif

TL;DR

The paper addresses privacy concerns in voice assistants by proposing Jaco, an offline, multilingual assistant that can run on low-resource devices. It presents a modular, containerized architecture with a two-tier design (Satellites and Master) and an offline pipeline (wake word, STT, NLU, TTS) coordinated via MQTT, enabling safe on-device training and operation. Skills are easily added and isolated within containers, governed by a per-skill permission system and a store for sharing, inspection, and offline deployment. Benchmark results demonstrate competitive offline performance on standard SLU tasks, highlighting strengths in medium/low-noise conditions and in hardware-enabled scenarios, with noted challenges in pronunciation of artist names across languages. Overall, Jaco offers a privacy-preserving, extensible, and practical offline alternative to cloud-based voice assistants for real-world use on consumer hardware.

Abstract

With the recent advance in speech technology, smart voice assistants have been improved and are now used by many people. But often these assistants are running online as a cloud service and are not always known for a good protection of users' privacy. This paper presents the architecture of a novel voice assistant, called Jaco, with the following features: (a) It can run completely offline, even on low resource devices like a RaspberryPi. (b) Through a skill concept it can be easily extended. (c) The architectural focus is on protecting users' privacy, but without restricting capabilities for developers. (d) It supports multiple languages. (e) It is competitive with other voice assistant solutions. In this respect the assistant combines and extends the advantages of other approaches.

Jaco: An Offline Running Privacy-aware Voice Assistant

TL;DR

The paper addresses privacy concerns in voice assistants by proposing Jaco, an offline, multilingual assistant that can run on low-resource devices. It presents a modular, containerized architecture with a two-tier design (Satellites and Master) and an offline pipeline (wake word, STT, NLU, TTS) coordinated via MQTT, enabling safe on-device training and operation. Skills are easily added and isolated within containers, governed by a per-skill permission system and a store for sharing, inspection, and offline deployment. Benchmark results demonstrate competitive offline performance on standard SLU tasks, highlighting strengths in medium/low-noise conditions and in hardware-enabled scenarios, with noted challenges in pronunciation of artist names across languages. Overall, Jaco offers a privacy-preserving, extensible, and practical offline alternative to cloud-based voice assistants for real-world use on consumer hardware.

Abstract

With the recent advance in speech technology, smart voice assistants have been improved and are now used by many people. But often these assistants are running online as a cloud service and are not always known for a good protection of users' privacy. This paper presents the architecture of a novel voice assistant, called Jaco, with the following features: (a) It can run completely offline, even on low resource devices like a RaspberryPi. (b) Through a skill concept it can be easily extended. (c) The architectural focus is on protecting users' privacy, but without restricting capabilities for developers. (d) It supports multiple languages. (e) It is competitive with other voice assistant solutions. In this respect the assistant combines and extends the advantages of other approaches.
Paper Structure (6 sections, 4 figures, 2 tables)

This paper contains 6 sections, 4 figures, 2 tables.

Figures (4)

  • Figure 1: General architecture of a voice assistant.
  • Figure 2: Process of a voice interaction using Jaco.
  • Figure 3: Store with permission notifications of the selected skill.
  • Figure 4: Benchmark coffee orders with noisy backgrounds. The results of DialogFlow, Watson, Luis and Rhino have been taken from RHIBEN.