Dynamic Feature Selection based on Rule-based Learning for Explainable Classification with Uncertainty Quantification
Javier Fumanal-Idocin, Raquel Fernandez-Peralta, Javier Andreu-Perez
TL;DR
This work addresses the need for explainable dynamic feature selection by formalizing uncertainty sources specific to sequential feature acquisition and adapting a rule-based global classifier for DFS. It introduces a model-agnostic DFS policy that greedily selects features to minimize divergence from a global rule-based predictor while accounting for epistemic costs, and uses an ensemble-/fuzzy-rule-based approach to quantify uncertainty during rule firings. Empirical results on five tabular datasets show competitive accuracy against state-of-the-art greedy and RL-based DFS methods, with the added benefit of transparent, interpretable predictions. The findings also reveal that predictive entropy decreases with more features largely due to information accumulation rather than the policy's effectiveness, and calibration behavior can diverge from confidence gains, highlighting important considerations for risk-sensitive applications like clinical decision support.
Abstract
Dynamic feature selection (DFS) offers a compelling alternative to traditional, static feature selection by adapting the selected features to each individual sample. This provides insights into the decision-making process for each case, which makes DFS especially significant in settings where decision transparency is key, i.e., clinical decisions. However, existing DFS methods use opaque models, which hinder their applicability in real-life scenarios. DFS also introduces new own sources of uncertainty compared to the static setting, which is also not considered in the existing literature. In this paper, we formalize the additional sources of uncertainty in DFS, and give formulas to estimate them. We also propose novel approach by leveraging a rule-based system as a base classifier for the DFS process, which enhances decision interpretability compared to neural estimators. Finally, we demonstrate the competitive performance of our rule-based DFS approach against established and state-of-the-art greedy and reinforcement learning methods, which are mostly considered opaque, compared to our explainable rulebased system.
