Task-Oriented Communication with Out-of-Distribution Detection: An Information Bottleneck Framework
Hongru Li, Wentao Yu, Hengtao He, Jiawei Shao, Shenghui Song, Jun Zhang, Khaled B. Letaief
TL;DR
This work addresses the open-world challenge of out-of-distribution (OoD) data in task-oriented communication for edge inference. It extends the information bottleneck (IB) framework to a class-conditional formulation (CCIB), incorporating class-conditioned latent priors and a contrastive (triplet) separation loss to make in-distribution latent representations distinct from OoD ones, while preserving the rate-distortion tradeoff via a variational upper bound $\mathcal{L}_{VCCIB}$. The approach uses Gaussian latent models with epoch-adaptive priors and end-to-end training over a joint source-channel coding pipeline, enabling reliable OoD detection without requiring OoD retraining. Experimental results on CIFAR-10 with LSUN and Tiny ImageNet OoD data show improved OoD detection (AUROC improvements) and robust performance under both static and dynamic channel conditions, demonstrating practical value for edge-enabled, low-latency inference in open-world networks.
Abstract
Task-oriented communication is an emerging paradigm for next-generation communication networks, which extracts and transmits task-relevant information, instead of raw data, for downstream applications. Most existing deep learning (DL)-based task-oriented communication systems adopt a closed-world scenario, assuming either the same data distribution for training and testing, or the system could have access to a large out-of-distribution (OoD) dataset for retraining. However, in practical open-world scenarios, task-oriented communication systems need to handle unknown OoD data. Under such circumstances, the powerful approximation ability of learning methods may force the task-oriented communication systems to overfit the training data (i.e., in-distribution data) and provide overconfident judgments when encountering OoD data. Based on the information bottleneck (IB) framework, we propose a class conditional IB (CCIB) approach to address this problem in this paper, supported by information-theoretical insights. The idea is to extract distinguishable features from in-distribution data while keeping their compactness and informativeness. This is achieved by imposing the class conditional latent prior distribution and enforcing the latent of different classes to be far away from each other. Simulation results shall demonstrate that the proposed approach detects OoD data more efficiently than the baselines and state-of-the-art approaches, without compromising the rate-distortion tradeoff.
