LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance

Ioannis Prokopiou; Ioannis Sina; Agisilaos Kounelis; Pantelis Vikatos; Themos Stafylakis

LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance

Ioannis Prokopiou, Ioannis Sina, Agisilaos Kounelis, Pantelis Vikatos, Themos Stafylakis

TL;DR

LabelBuddy is introduced, an open-source collaborative auto-tagging audio annotation tool designed to bridge the gap between human intent and machine understanding, which supports multi-user consensus, containerized model isolation, and a roadmap for extending agents and LALMs.

Abstract

The advancement of Machine learning (ML), Large Audio Language Models (LALMs), and autonomous AI agents in Music Information Retrieval (MIR) necessitates a shift from static tagging to rich, human-aligned representation learning. However, the scarcity of open-source infrastructure capable of capturing the subjective nuances of audio annotation remains a critical bottleneck. This paper introduces \textbf{LabelBuddy}, an open-source collaborative auto-tagging audio annotation tool designed to bridge the gap between human intent and machine understanding. Unlike static tools, it decouples the interface from inference via containerized backends, allowing users to plug in custom models for AI-assisted pre-annotation. We describe the system architecture, which supports multi-user consensus, containerized model isolation, and a roadmap for extending agents and LALMs. Code available at https://github.com/GiannisProkopiou/gsoc2022-Label-buddy.

LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance

TL;DR

Abstract

Paper Structure (19 sections, 2 figures, 1 table)

This paper contains 19 sections, 2 figures, 1 table.

Introduction
Related Work
Annotation Platforms & Domain Specificity.
LALMs & HITL Workflows.
Subjective Evaluation & RLHF.
System Architecture
Backend & Data Model
Containerized Inference Engine
Workflow & Interface
Project Setup & Task Management
The Annotation Loop
Case Study: NLP Music Tagging
Model Integration
The "Human-Verify" Loop.
Correction & Consensus.
...and 4 more sections

Figures (2)

Figure 1: System Architecture Overview: The architecture decouples the Django web server from Dockerized ML inference.
Figure 2: The annotation interface displaying AI-generated predictions as editable waveform regions.

LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance

TL;DR

Abstract

LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance

Authors

TL;DR

Abstract

Table of Contents

Figures (2)