FINER: Enhancing State-of-the-art Classifiers with Feature Attribution to Facilitate Security Analysis
Yiling He, Jian Lou, Zhan Qin, Kui Ren
TL;DR
FINER addresses the opacity of state-of-the-art risk detectors by introducing an explainable risk detection system that couples a high-fidelity, task-aware explanatory framework with data-driven classifiers. It achieves this through explanation-guided model updating (multi-task learning with explanation-driven data augmentation) and task-aware explanation generation (domain-adjusted FA methods with IC-based ensemble), guided by a formal IC abstraction layer. Extensive evaluations across Android malware, Windows malware, and vulnerability detection tasks show that FINER improves explanation fidelity substantially without harming accuracy, and enables function-level malware localization that outperforms a state-of-the-art tool. The framework’s staged architecture and open-source availability make it practical for real-world security analysis and adaptable to additional risk domains.
Abstract
Deep learning classifiers achieve state-of-the-art performance in various risk detection applications. They explore rich semantic representations and are supposed to automatically discover risk behaviors. However, due to the lack of transparency, the behavioral semantics cannot be conveyed to downstream security experts to reduce their heavy workload in security analysis. Although feature attribution (FA) methods can be used to explain deep learning, the underlying classifier is still blind to what behavior is suspicious, and the generated explanation cannot adapt to downstream tasks, incurring poor explanation fidelity and intelligibility. In this paper, we propose FINER, the first framework for risk detection classifiers to generate high-fidelity and high-intelligibility explanations. The high-level idea is to gather explanation efforts from model developer, FA designer, and security experts. To improve fidelity, we fine-tune the classifier with an explanation-guided multi-task learning strategy. To improve intelligibility, we engage task knowledge to adjust and ensemble FA methods. Extensive evaluations show that FINER improves explanation quality for risk detection. Moreover, we demonstrate that FINER outperforms a state-of-the-art tool in facilitating malware analysis.
