Table of Contents
Fetching ...

MetaChest: Generalized few-shot learning of pathologies from chest X-rays

Berenice Montalvo-Lezama, Gibran Fuentes-Pineda

TL;DR

The paper tackles the scarcity of annotated chest X-ray data by adopting generalized few-shot learning to classify multiple pathologies with both seen and unseen labels. It introduces MetaChest, a large, multi-source chest X-ray dataset with a dedicated meta-learning partition and a multi-label episode generator, enabling robust GFSL evaluation. Through experiments comparing standard transfer learning (BatchBased) and a multi-label ProtoNet extension (ProtoNet-ML), the study shows that higher episode complexity (more classes per episode) and more per-class examples improve performance, with transfer learning approaches often outperforming meta-learners in this domain. The work highlights the importance of higher-resolution images and efficient architectures for medical tasks, and suggests practical deployment benefits and directions for future research, including vision foundation models and multimodal integration.

Abstract

The limited availability of annotated data presents a major challenge for applying deep learning methods to medical image analysis. Few-shot learning methods aim to recognize new classes from only a small number of labeled examples. These methods are typically studied under the standard few-shot learning setting, where all classes in a task are new. However, medical applications such as pathology classification from chest X-rays often require learning new classes while simultaneously leveraging knowledge of previously known ones, a scenario more closely aligned with generalized few-shot classification. Despite its practical relevance, few-shot learning has been scarcely studied in this context. In this work, we present MetaChest, a large-scale dataset of 479,215 chest X-rays collected from four public databases. MetaChest includes a meta-set partition specifically designed for standard few-shot classification, as well as an algorithm for generating multi-label episodes. We conduct extensive experiments evaluating both a standard transfer learning approach and an extension of ProtoNet across a wide range of few-shot multi-label classification tasks. Our results demonstrate that increasing the number of classes per episode and the number of training examples per class improves classification performance. Notably, the transfer learning approach consistently outperforms the ProtoNet extension, despite not being tailored for few-shot learning. We also show that higher-resolution images improve accuracy at the cost of additional computation, while efficient model architectures achieve comparable performance to larger models with significantly reduced resource requirements.

MetaChest: Generalized few-shot learning of pathologies from chest X-rays

TL;DR

The paper tackles the scarcity of annotated chest X-ray data by adopting generalized few-shot learning to classify multiple pathologies with both seen and unseen labels. It introduces MetaChest, a large, multi-source chest X-ray dataset with a dedicated meta-learning partition and a multi-label episode generator, enabling robust GFSL evaluation. Through experiments comparing standard transfer learning (BatchBased) and a multi-label ProtoNet extension (ProtoNet-ML), the study shows that higher episode complexity (more classes per episode) and more per-class examples improve performance, with transfer learning approaches often outperforming meta-learners in this domain. The work highlights the importance of higher-resolution images and efficient architectures for medical tasks, and suggests practical deployment benefits and directions for future research, including vision foundation models and multimodal integration.

Abstract

The limited availability of annotated data presents a major challenge for applying deep learning methods to medical image analysis. Few-shot learning methods aim to recognize new classes from only a small number of labeled examples. These methods are typically studied under the standard few-shot learning setting, where all classes in a task are new. However, medical applications such as pathology classification from chest X-rays often require learning new classes while simultaneously leveraging knowledge of previously known ones, a scenario more closely aligned with generalized few-shot classification. Despite its practical relevance, few-shot learning has been scarcely studied in this context. In this work, we present MetaChest, a large-scale dataset of 479,215 chest X-rays collected from four public databases. MetaChest includes a meta-set partition specifically designed for standard few-shot classification, as well as an algorithm for generating multi-label episodes. We conduct extensive experiments evaluating both a standard transfer learning approach and an extension of ProtoNet across a wide range of few-shot multi-label classification tasks. Our results demonstrate that increasing the number of classes per episode and the number of training examples per class improves classification performance. Notably, the transfer learning approach consistently outperforms the ProtoNet extension, despite not being tailored for few-shot learning. We also show that higher-resolution images improve accuracy at the cost of additional computation, while efficient model architectures achieve comparable performance to larger models with significantly reduced resource requirements.

Paper Structure

This paper contains 32 sections, 5 equations, 9 figures, 9 tables, 1 algorithm.

Figures (9)

  • Figure 1: Distribution of labels per pathology and dataset in MetaChest.
  • Figure 2: Co-occurrence matrix of MetaChest pathologies.
  • Figure 3: Venn diagram illustrating the relationship between meta-training $\mathcal{C}_{meta-trn}$, meta-validation $\mathcal{C}_{meta-val}$, and meta-test $\mathcal{C}_{meta-tst}$ class sets. Blue indicates examples in $\mathcal{D}_{meta-trn}$, pink in $\mathcal{D}_{meta-val}$, and purple in $\mathcal{D}_{meta-tst}$. Since these sets are disjoint and episodes are generated from only one set at a time, the meta-validation and the meta-test episodes contain examples that are not used during meta-training.
  • Figure 4: Representation subspaces for multi-class ProtoNet (left) and ProtoNet-ML (right). In multi-class ProtoNet, a representation vector $g_{\phi}(\mathbf{x})$ is associated with only one prototype, whereas in ProtoNet-ML, a representation vector $f_{\phi}(\mathbf{x})$ may be associated with one or more prototypes. Note that $f_{\phi}(\cdot)$ is the network backbone followed by an encoding layer to reduce representation vector dimensionality.
  • Figure 5: Performance of BatchBased (BB) and ProtoNet-ML (PN) on pathology classification tasks across varying $n$-way, $n$-unseen, and $k$-shot configurations.
  • ...and 4 more figures