Explaining AutoClustering: Uncovering Meta-Feature Contribution in AutoML for Clustering
Matheus Camilo da Silva, Leonardo Arrighi, Ana Carolina Lorena, Sylvio Barbon Junior
TL;DR
This work tackles the lack of transparency in AutoClustering by delivering a unified taxonomy of meta-features across 22 frameworks and a two-level explainability approach. Global explanations via Decision Predicate Graphs reveal which meta-features structurally drive meta-model decisions, while local SHAP attributions illuminate instance-level feature contributions to specific clustering recommendations. An explainability-driven ablation demonstrates that a small core of meta-features suffices for most predictive power, enabling large reductions in feature extraction cost with limited accuracy loss. Together, these findings provide practical guidelines for building more interpretable and cost-efficient AutoML systems for clustering, including bias diagnostics and improved meta-feature engineering. The work lays groundwork for auditing and refining auto clustering pipelines, with implications for reliability and deployment in real-world unsupervised learning tasks.
Abstract
AutoClustering methods aim to automate unsupervised learning tasks, including algorithm selection (AS), hyperparameter optimization (HPO), and pipeline synthesis (PS), by often leveraging meta-learning over dataset meta-features. While these systems often achieve strong performance, their recommendations are often difficult to justify: the influence of dataset meta-features on algorithm and hyperparameter choices is typically not exposed, limiting reliability, bias diagnostics, and efficient meta-feature engineering. This limits reliability and diagnostic insight for further improvements. In this work, we investigate the explainability of the meta-models in AutoClustering. We first review 22 existing methods and organize their meta-features into a structured taxonomy. We then apply a global explainability technique (i.e., Decision Predicate Graphs) to assess feature importance within meta-models from selected frameworks. Finally, we use local explainability tools such as SHAP (SHapley Additive exPlanations) to analyse specific clustering decisions. Our findings highlight consistent patterns in meta-feature relevance, identify structural weaknesses in current meta-learning strategies that can distort recommendations, and provide actionable guidance for more interpretable Automated Machine Learning (AutoML) design. This study therefore offers a practical foundation for increasing decision transparency in unsupervised learning automation.
