Table of Contents
Fetching ...

Redundancy-optimized Multi-head Attention Networks for Multi-View Multi-Label Feature Selection

Yuzhou Liu, Jiarui Liu, Wanfu Gao

TL;DR

MVML feature selection is challenged by complex inter-view relations and feature redundancy. RMAN-MMFS integrates one attention head per view to model view-self feature-label interactions and employs cross-view attention to capture inter-view complementarity, augmented by static and dynamic redundancy penalties. The framework delivers a unified objective and demonstrates superior performance across six real-world datasets against six baselines, producing compact, discriminative feature subsets with good generalization. The approach offers scalable, cross-view-aware feature selection with practical impact for MVML classification tasks.

Abstract

Multi-view multi-label data offers richer perspectives for artificial intelligence, but simultaneously presents significant challenges for feature selection due to the inherent complexity of interrelations among features, views and labels. Attention mechanisms provide an effective way for analyzing these intricate relationships. They can compute importance weights for information by aggregating correlations between Query and Key matrices to focus on pertinent values. However, existing attention-based feature selection methods predominantly focus on intra-view relationships, neglecting the complementarity of inter-view features and the critical feature-label correlations. Moreover, they often fail to account for feature redundancy, potentially leading to suboptimal feature subsets. To overcome these limitations, we propose a novel method based on Redundancy-optimized Multi-head Attention Networks for Multi-view Multi-label Feature Selection (RMAN-MMFS). Specifically, we employ each individual attention head to model intra-view feature relationships and use the cross-attention mechanisms between different heads to capture inter-view feature complementarity. Furthermore, we design static and dynamic feature redundancy terms: the static term mitigates redundancy within each view, while the dynamic term explicitly models redundancy between unselected and selected features across the entire selection process, thereby promoting feature compactness. Comprehensive evaluations on six real-world datasets, compared against six multi-view multi-label feature selection methods, demonstrate the superior performance of the proposed method.

Redundancy-optimized Multi-head Attention Networks for Multi-View Multi-Label Feature Selection

TL;DR

MVML feature selection is challenged by complex inter-view relations and feature redundancy. RMAN-MMFS integrates one attention head per view to model view-self feature-label interactions and employs cross-view attention to capture inter-view complementarity, augmented by static and dynamic redundancy penalties. The framework delivers a unified objective and demonstrates superior performance across six real-world datasets against six baselines, producing compact, discriminative feature subsets with good generalization. The approach offers scalable, cross-view-aware feature selection with practical impact for MVML classification tasks.

Abstract

Multi-view multi-label data offers richer perspectives for artificial intelligence, but simultaneously presents significant challenges for feature selection due to the inherent complexity of interrelations among features, views and labels. Attention mechanisms provide an effective way for analyzing these intricate relationships. They can compute importance weights for information by aggregating correlations between Query and Key matrices to focus on pertinent values. However, existing attention-based feature selection methods predominantly focus on intra-view relationships, neglecting the complementarity of inter-view features and the critical feature-label correlations. Moreover, they often fail to account for feature redundancy, potentially leading to suboptimal feature subsets. To overcome these limitations, we propose a novel method based on Redundancy-optimized Multi-head Attention Networks for Multi-view Multi-label Feature Selection (RMAN-MMFS). Specifically, we employ each individual attention head to model intra-view feature relationships and use the cross-attention mechanisms between different heads to capture inter-view feature complementarity. Furthermore, we design static and dynamic feature redundancy terms: the static term mitigates redundancy within each view, while the dynamic term explicitly models redundancy between unselected and selected features across the entire selection process, thereby promoting feature compactness. Comprehensive evaluations on six real-world datasets, compared against six multi-view multi-label feature selection methods, demonstrate the superior performance of the proposed method.

Paper Structure

This paper contains 20 sections, 11 equations, 4 figures, 6 tables, 1 algorithm.

Figures (4)

  • Figure 1: Example of relationships in MVML data. ① Feature-label correlations determine which features require attention. ② Inter-view relations may indicate redundant features (all of them are "ear”). ③ Cross-view relations could show complementary features (the eyes are "jade-green”).
  • Figure 2: An illustration of the proposed RMAN-MMFS is presented, showcasing: (1) Multi-head attention for multi-view data analysis, which captures feature correlations and complementarity; (2) Static and dynamic redundancy penalty terms that quantify inter-feature redundancy. Feature weights are subsequently computed through this comprehensive integration process.
  • Figure 3: Seven methods on SCENE in terms of AP, AUC, CE and RL.
  • Figure 4: Parameter sensitivity studies on the SCENE datasets.