Semantic-Inductive Attribute Selection for Zero-Shot Learning
Juan Jose Herrera-Aranda, Guillermo Gomez-Trenado, Francisco Herrera, Isaac Triguero
TL;DR
This work tackles the difficulty of inductive zero-shot learning (ZSL) arising from noisy and redundant semantic attributes. It introduces a class-stratified cross-validation partitioning to simulate unseen conditions using only seen data, enabling robust evaluation of attribute selection. Two complementary strategies are proposed: a rank-based embedded feature selection (RFS) with cross-validated consensus, and a genetic algorithm (GA) that globally searches attribute subsets with fitness tied to pseudo-unseen performance. Across five diverse benchmarks, both methods improve unseen accuracy over the baseline SAE, with RFS offering efficiency and GA providing broader space exploration at higher cost. The findings highlight semantic-space redundancy and demonstrate that systematic attribute refinement can enhance generalization in open-world AI tasks.
Abstract
Zero-Shot Learning is an important paradigm within General-Purpose Artificial Intelligence Systems, particularly in those that operate in open-world scenarios where systems must adapt to new tasks dynamically. Semantic spaces play a pivotal role as they bridge seen and unseen classes, but whether human-annotated or generated by a machine learning model, they often contain noisy, redundant, or irrelevant attributes that hinder performance. To address this, we introduce a partitioning scheme that simulates unseen conditions in an inductive setting (which is the most challenging), allowing attribute relevance to be assessed without access to semantic information from unseen classes. Within this framework, we study two complementary feature-selection strategies and assess their generalisation. The first adapts embedded feature selection to the particular demands of ZSL, turning model-driven rankings into meaningful semantic pruning; the second leverages evolutionary computation to directly explore the space of attribute subsets more broadly. Experiments on five benchmark datasets (AWA2, CUB, SUN, aPY, FLO) show that both methods consistently improve accuracy on unseen classes by reducing redundancy, but in complementary ways: RFS is efficient and competitive though dependent on critical hyperparameters, whereas GA is more costly yet explores the search space more broadly and avoids such dependence. These results confirm that semantic spaces are inherently redundant and highlight the proposed partitioning scheme as an effective tool to refine them under inductive conditions.
