Automated Machine Learning for Multi-Label Classification
Marcel Wever
TL;DR
This work addresses the challenge of automating multi-label classification (MLC) within AutoML, focusing on the enormous, hierarchical search space that emerges when configuring MLC methods. It introduces ML-Plan, a hierarchical task network (HTN) planning framework for AutoML, first for short pipelines and then extended to unlimited-length pipelines and to MLC, enabling scalable, structured exploration of complex configurations. The thesis further extends the approach with methods like LiBRe for label-wise base-learner selection, ensembles of nested dichotomies, and a runtime predictor to improve efficiency, complemented by empirical evaluations that compare against state-of-the-art AutoML approaches. The Open-Ended discussion includes practical implications for On-The-Fly Computing and on-demand ML services, highlighting both methodological advances and remaining questions on search-space management, pruning, and runtime-aware optimization in real-world deployments.
Abstract
Automated machine learning (AutoML) aims to select and configure machine learning algorithms and combine them into machine learning pipelines tailored to a dataset at hand. For supervised learning tasks, most notably binary and multinomial classification, aka single-label classification (SLC), such AutoML approaches have shown promising results. However, the task of multi-label classification (MLC), where data points are associated with a set of class labels instead of a single class label, has received much less attention so far. In the context of multi-label classification, the data-specific selection and configuration of multi-label classifiers are challenging even for experts in the field, as it is a high-dimensional optimization problem with multi-level hierarchical dependencies. While for SLC, the space of machine learning pipelines is already huge, the size of the MLC search space outnumbers the one of SLC by several orders. In the first part of this thesis, we devise a novel AutoML approach for single-label classification tasks optimizing pipelines of machine learning algorithms, consisting of two algorithms at most. This approach is then extended first to optimize pipelines of unlimited length and eventually configure the complex hierarchical structures of multi-label classification methods. Furthermore, we investigate how well AutoML approaches that form the state of the art for single-label classification tasks scale with the increased problem complexity of AutoML for multi-label classification. In the second part, we explore how methods for SLC and MLC could be configured more flexibly to achieve better generalization performance and how to increase the efficiency of execution-based AutoML systems.
