MLRS-PDS: A Meta-learning recommendation of dynamic ensemble selection pipelines
Hesam Jalalian, Rafael M. O. Cruz
TL;DR
Dynamic Selection (DS) performance hinges on the chosen pool of classifiers and the DS method, motivating automated, dataset-aware pipeline design. The authors introduce MLRS, a multi-label meta-learning framework that maps dataset meta-features to preferred DS configurations, with three variants: MLRS-P, MLRS-DS, and MLRS-PDS, including a chained approach for full automation. Trained on a meta-dataset derived from $129$ meta-features and evaluated on $288$ datasets, MLRS variants outperform fixed-pool and fixed-DS baselines, with MLRS-PDS delivering the strongest gains by jointly selecting the pool and DS. The work demonstrates the practical value of meta-learning in AutoML-like DS pipeline design, enabling efficient, dataset-specific DS configurations without exhaustive search.
Abstract
Dynamic Selection (DS), where base classifiers are chosen from a classifier's pool for each new instance at test time, has shown to be highly effective in pattern recognition. However, instability and redundancy in the classifier pools can impede computational efficiency and accuracy in dynamic ensemble selection. This paper introduces a meta-learning recommendation system (MLRS) to recommend the optimal pool generation scheme for DES methods tailored to individual datasets. The system employs a meta-model built from dataset meta-features to predict the most suitable pool generation scheme and DES method for a given dataset. Through an extensive experimental study encompassing 288 datasets, we demonstrate that this meta-learning recommendation system outperforms traditional fixed pool or DES method selection strategies, highlighting the efficacy of a meta-learning approach in refining DES method selection. The source code, datasets, and supplementary results can be found in this project's GitHub repository: https://github.com/Menelau/MLRS-PDS.
