PathBench-MIL: A Comprehensive AutoML and Benchmarking Framework for Multiple Instance Learning in Histopathology
Siemen Brussee, Pieter A. Valkema, Jurre A. J. Weijer, Thom Doeleman, Anne M. R. Schrader, Jesper Kers
TL;DR
The paper tackles the challenge of evaluating and optimizing end-to-end MIL pipelines for histopathology WSIs under weak supervision. It introduces PathBench-MIL, an open-source framework built atop SlideFlow that supports end-to-end MIL workflows, including preprocessing, tiling, feature extraction, aggregation, and AutoML-based pipeline search. Key contributions include an Optuna-based AutoML engine with budget-aware pruning, a unified YAML configuration for reproducibility, and an interactive Dash visualization tool for exploring results. The framework enables dataset-aware, systematic comparisons across MIL configurations and tasks (classification, regression, survival), aiming to standardize evaluation practices in pathology AI.
Abstract
We introduce PathBench-MIL, an open-source AutoML and benchmarking framework for multiple instance learning (MIL) in histopathology. The system automates end-to-end MIL pipeline construction, including preprocessing, feature extraction, and MIL-aggregation, and provides reproducible benchmarking of dozens of MIL models and feature extractors. PathBench-MIL integrates visualization tooling, a unified configuration system, and modular extensibility, enabling rapid experimentation and standardization across datasets and tasks. PathBench-MIL is publicly available at https://github.com/Sbrussee/PathBench-MIL
