MEGAnno+: A Human-LLM Collaborative Annotation System
Hannah Kim, Kushan Mitra, Rafael Li Chen, Sajjadur Rahman, Dan Zhang
TL;DR
The paper addresses the challenge of obtaining high quality labeled data at scale by combining the efficiency of large language models with human verification. It introduces MEGAnno+, a human-LLM collaborative annotation system that integrates a reusable agent and prompt management framework, an LLM annotation pipeline, and an in-notebook verification widget to selectively validate labels. The main contributions include a novel data model with Agents, Jobs, and Verification, robust end-to-end LLM annotation with error handling and metadata capture, and an exploratory verification workflow that enables targeted human review. The approach aims to deliver reliable labeled data for domain-specific or privacy-sensitive tasks, while enabling reuse of configurations and facilitating comparisons across LLMs and prompts. The work highlights practical implications for deploying LLM annotators and discusses design choices, limitations, and avenues for future expansion.
Abstract
Large language models (LLMs) can label data faster and cheaper than humans for various NLP tasks. Despite their prowess, LLMs may fall short in understanding of complex, sociocultural, or domain-specific context, potentially leading to incorrect annotations. Therefore, we advocate a collaborative approach where humans and LLMs work together to produce reliable and high-quality labels. We present MEGAnno+, a human-LLM collaborative annotation system that offers effective LLM agent and annotation management, convenient and robust LLM annotation, and exploratory verification of LLM labels by humans.
