Help the machine to help you: an evaluation in the wild of egocentric data cleaning via skeptical learning
Andrea Bontempelli, Matteo Busso, Leonardo Javier Malcotti, Fausto Giunchiglia
TL;DR
This work evaluates Skeptical Learning (SKEL) in a real-world, longitudinal setting to clean egocentric contextual labels, focusing on location in university students using the iLog app. SKEL employs Gaussian Processes (GPs) to quantify uncertainty and guide user queries, with a two-phase, in-the-wild deployment that consolidates skeptical questions to reduce interruptions. Results show limited average gains over a non-skeptical baseline due to participant consistency and data gaps, though some individuals and longer studies may benefit from reduced annotation burden and improved data quality. The study demonstrates the feasibility of SKEL in mobile, in-the-wild data collection while highlighting design considerations such as scheduling, per-user hyperparameters, and the need for longer investigations and multi-modal inputs.
Abstract
Any digital personal assistant, whether used to support task performance, answer questions, or manage work and daily life, including fitness schedules, requires high-quality annotations to function properly. However, user annotations, whether actively produced or inferred from context (e.g., data from smartphone sensors), are often subject to errors and noise. Previous research on Skeptical Learning (SKEL) addressed the issue of noisy labels by comparing offline active annotations with passive data, allowing for an evaluation of annotation accuracy. However, this evaluation did not include confirmation from end-users, the best judges of their own context. In this study, we evaluate SKEL's performance in real-world conditions with actual users who can refine the input labels based on their current perspectives and needs. The study involves university students using the iLog mobile application on their devices over a period of four weeks. The results highlight the challenges of finding the right balance between user effort and data quality, as well as the potential benefits of using SKEL, which include reduced annotation effort and improved quality of collected data.
