Distillation-Accelerated Uncertainty Modeling for Multi-Objective RTA Interception
Gaoxiang Zhao, Ruina Qiu, Pengpeng Zhao, Rongjin Wang, Zhangang Lin, Xiaoqiang Wang
TL;DR
This paper addresses efficient and reliable traffic interception in Real-Time Auctions by jointly modeling multiple downstream objectives and prediction uncertainty. The authors introduce DAUM, a two-stage framework that combines multi-objective learning with uncertainty estimation, and they further apply knowledge distillation to produce a lightweight model that preserves uncertainty signals. Their experiments on the JD advertisement dataset show DAUM improves predictive performance across metrics while enabling a roughly tenfold increase in inference speed with the distilled model. The approach also demonstrates effective uncertainty transfer across correlated metrics, particularly useful under imbalanced label conditions, making uncertainty-aware interception practical for real-time deployment.
Abstract
Real-Time Auction (RTA) Interception aims to filter out invalid or irrelevant traffic to enhance the integrity and reliability of downstream data. However, two key challenges remain: (i) the need for accurate estimation of traffic quality together with sufficiently high confidence in the model's predictions, typically addressed through uncertainty modeling, and (ii) the efficiency bottlenecks that such uncertainty modeling introduces in real-time applications due to repeated inference. To address these challenges, we propose DAUM, a joint modeling framework that integrates multi-objective learning with uncertainty modeling, yielding both traffic quality predictions and reliable confidence estimates. Building on DAUM, we further apply knowledge distillation to reduce the computational overhead of uncertainty modeling, while largely preserving predictive accuracy and retaining the benefits of uncertainty estimation. Experiments on the JD advertisement dataset demonstrate that DAUM consistently improves predictive performance, with the distilled model delivering a tenfold increase in inference speed.
