RACH-Space: Reconstructing Adaptive Convex Hull Space with Applications in Weak Supervision
Woojoo Na, Abiy Tasissa
TL;DR
RACH-Space tackles weakly supervised learning where only incomplete, noisy weak signals are available. It introduces a geometry-based framework that treats weak signals as columns of a matrix and leverages nested convex hulls to define a safe region guiding label estimation, parameterized by a single $\epsilon$ that bounds per-signal error rates. The method formulates a quadratic program to recover a probabilistic label vector under a unit-sum constraint, and uses convex-hull structure to regularize the solution, with efficiency enhanced via signal grouping. Empirically, RACH-Space achieves state-of-the-art or on-par performance on 14 real-world WRENCH datasets, demonstrating robustness without strong distributional assumptions.
Abstract
We introduce RACH-Space, an algorithm for labelling unlabelled data in weakly supervised learning, given incomplete, noisy information about the labels. RACH-Space offers simplicity in implementation without requiring hard assumptions on data or the sources of weak supervision, and is well suited for practical applications where fully labelled data is not available. Our method is built upon a geometrical interpretation of the space spanned by the set of weak signals. We also analyze the theoretical properties underlying the relationship between the convex hulls in this space and the accuracy of our output labels, bridging geometry with machine learning. Empirical results demonstrate that RACH-Space works well in practice and compares favorably to the best existing label models for weakly supervised learning.
