Overview of the CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification

Hexin Dong; Yi Lin; Pengyu Zhou; Fengnian Zhao; Alan Clint Legasto; Mingquan Lin; Hao Chen; Yuzhe Yang; George Shih; Yifan Peng

Overview of the CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification

Hexin Dong, Yi Lin, Pengyu Zhou, Fengnian Zhao, Alan Clint Legasto, Mingquan Lin, Hao Chen, Yuzhe Yang, George Shih, Yifan Peng

TL;DR

The CXR-LT 2026 challenge introduces a multi-center dataset comprising over 145,000 images from PadChest and NIH Chest X-ray datasets and demonstrates that large-scale vision-language pre-training significantly mitigates the performance drop typically associated with zero-shot diagnosis.

Abstract

Chest X-ray (CXR) interpretation is hindered by the long-tailed distribution of pathologies and the open-world nature of clinical environments. Existing benchmarks often rely on closed-set classes from single institutions, failing to capture the prevalence of rare diseases or the appearance of novel findings. To address this, we present the CXR-LT 2026 challenge. This third iteration of the benchmark introduces a multi-center dataset comprising over 145,000 images from PadChest and NIH Chest X-ray datasets. The challenge defines two core tasks: (1) Robust Multi-Label Classification on 30 known classes and (2) Open-World Generalization to 6 unseen (out-of-distribution) rare disease classes. We report the results of the top-performing teams, evaluating them via mean Average Precision (mAP), AUROC, and F1-score. The winning solutions achieved an mAP of 0.5854 on Task 1 and 0.4315 on Task 2, demonstrating that large-scale vision-language pre-training significantly mitigates the performance drop typically associated with zero-shot diagnosis.

Overview of the CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification

TL;DR

Abstract

Paper Structure (17 sections, 1 figure, 3 tables)

This paper contains 17 sections, 1 figure, 3 tables.

Introduction
Materials and Methods
Dataset Collection
Label Collection and Task Definition
Task 1: Robust Multi-Label Classification
Task 2: Open-World Generalization
Evaluation Metrics
Challenge procedure
Results
Team participation across stages
Task 1: Robust Prediction Results
Task 2: Open-World / OOD Multi-Label Classification
Discussion
The Dominance of Foundation Models
Calibration vs. Discrimination
...and 2 more sections

Figures (1)

Figure 1: Per-class label distribution in the CXR-LT 2026 evaluation sets (development + test).

Overview of the CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification

TL;DR

Abstract

Overview of the CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification

Authors

TL;DR

Abstract

Table of Contents

Figures (1)