Expected Confidence Dependency: A Novel Rough Set-Based Approach to Feature Selection
Saeed Rasouli, Hamid Karamikabir
TL;DR
This work introduces Expected Confidence Dependency (ECD), a soft-computing, probabilistic generalization of rough-set dependency that weights each conditional equivalence class by its classification confidence. ECD computes an overall dependency Exp(C,D) as the normalized sum of block-wise majority confidences, enabling smooth, partial, and uncertainty-aware feature selection. Theoretical guarantees include normalization, monotonicity, and invariance properties, while experiments on four UCI datasets show that ECD-based forward selection yields more accurate and compact feature subsets than classical, relative, or direct dependency criteria. The approach demonstrates robustness to noise and partial consistency, with practical potential across high-dimensional, heterogeneous data domains. Extensions to incomplete data, scalability improvements, and broader domain applications are identified as promising avenues for future work.
Abstract
This paper proposes Expected Confidence Dependency (ECD), a novel, soft computing-oriented, accuracy driven dependency measure for feature selection within the rough set theory framework. Unlike traditional rough set dependency measures that rely on binary characterizations of conditional blocks, ECD assigns confidence-based contributions to individual equivalence blocks and aggregates them through a normalized expectation operator. We formally establish several desirable properties of ECD, including normalization, compatibility with classical dependency, monotonicity, and invariance under structural and label-preserving transformations.
