Information Templates: A New Paradigm for Intelligent Active Feature Acquisition
Hung-Tien Huang, Dzung Dinh, Junier B. Oliva
TL;DR
The paper tackles the challenge of costly, instance-adaptive feature acquisition by introducing TAFA, a non-RL, template-based framework that learns a library of informative feature subsets (templates) to guide next-feature decisions. By formulating template discovery as a submodular set-optimization problem and combining mutation-guided greedy search with continuous relaxation, TAFA efficiently narrows the action space while maintaining strong cost-benefit performance. The approach yields an interpretable policy via distillation into step-wise decision trees that expose explicit acquisition rules. Extensive experiments on synthetic and real-world datasets show TAFA outperforms RL-free baselines in accuracy and acquisition efficiency, while delivering substantial speedups and clear interpretability, making it practical for real-time AFA scenarios.
Abstract
Active feature acquisition (AFA) is an instance-adaptive paradigm in which, at inference time, a policy sequentially chooses which features to acquire (at a cost) before predicting. Existing approaches either train reinforcement learning policies, which deal with a difficult MDP, or greedy policies that cannot account for the joint informativeness of features or require knowledge about the underlying data distribution. To overcome this, we propose Template-based AFA (TAFA), a non-greedy framework that learns a small library of feature templates -- sets of features that are jointly informative -- and uses this library of templates to guide the next feature acquisitions. Through identifying feature templates, the proposed framework not only significantly reduces the action space considered by the policy but also alleviates the need to estimate the underlying data distribution. Extensive experiments on synthetic and real-world datasets show that TAFA outperforms the existing state-of-the-art baselines while achieving lower overall acquisition cost and computation.
