An Ordinal Regression Framework for a Deep Learning Based Severity Assessment for Chest Radiographs
Patrick Wienholt, Alexander Hermans, Firas Khader, Behrus Puladi, Bastian Leibe, Christiane Kuhl, Sven Nebelung, Daniel Truhn
TL;DR
The paper tackles ordinal severity scoring in chest radiographs by introducing a modular ordinal-regression framework that separates the task into a model, a target function, and a classification function. It systematically compares multiple target encodings (One-Hot, Gaussian, Continuous, Progress-Bar, Soft-Progress-Bar, Binary) using ResNet50 and ViT-B-16, evaluated with unweighted and weighted Cohen's kappa ($kappa$) metrics. Key findings show no universally optimal method across metrics or architectures; One-Hot excels for unweighted kappa, while encodings that reflect ordinal structure (Gaussian, Progress-Bar, Soft-Progress-Bar) often perform better for weighted kappas. The work provides practical guidance for selecting encodings and weights in clinical settings and offers code for reproducible evaluation of ordinal regression in medical imaging.
Abstract
This study investigates the application of ordinal regression methods for categorizing disease severity in chest radiographs. We propose a framework that divides the ordinal regression problem into three parts: a model, a target function, and a classification function. Different encoding methods, including one-hot, Gaussian, progress-bar, and our soft-progress-bar, are applied using ResNet50 and ViT-B-16 deep learning models. We show that the choice of encoding has a strong impact on performance and that the best encoding depends on the chosen weighting of Cohen's kappa and also on the model architecture used. We make our code publicly available on GitHub.
