Table of Contents
Fetching ...

SLAM-AGS: Slide-Label Aware Multi-Task Pretraining Using Adaptive Gradient Surgery in Computational Cytology

Marco Acerbis, Swarnadip Chatterjee, Christophe Avenel, Joakim Lindblad

TL;DR

This work tackles learning from weak slide-level labels in computational cytology where witness rates are extremely low. It introduces SLAM-AGS, a slide-label aware multitask pretraining framework that combines a Similarity loss for slide-negative patches and a SimCLR-based contrastive loss for slide-positive patches, augmented by Adaptive Gradient Surgery to mitigate gradient conflicts. On a bone-marrow cytology dataset with simulated witness rates, SLAM-AGS yields superior bag-level F1 and Top-400 key-patch retrieval relative to weakly supervised and self-supervised baselines, with the largest gains at the lowest witness rates. The approach is released openly to facilitate reproducibility and demonstrates that managing gradient interference is crucial for stable, label-efficient pretraining in multi-task MIL settings.

Abstract

Computational cytology faces two major challenges: i) instance-level labels are unreliable and prohibitively costly to obtain, ii) witness rates are extremely low. We propose SLAM-AGS, a Slide-Label-Aware Multitask pretraining framework that jointly optimizes (i) a weakly supervised similarity objective on slide-negative patches and (ii) a self-supervised contrastive objective on slide-positive patches, yielding stronger performance on downstream tasks. To stabilize learning, we apply Adaptive Gradient Surgery to tackle conflicting task gradients and prevent model collapse. We integrate the pretrained encoder into an attention-based Multiple Instance Learning aggregator for bag-level prediction and attention-guided retrieval of the most abnormal instances in a bag. On a publicly available bone-marrow cytology dataset, with simulated witness rates from 10% down to 0.5%, SLAM-AGS improves bag-level F1-Score and Top 400 positive cell retrieval over other pretraining methods, with the largest gains at low witness rates, showing that resolving gradient interference enables stable pretraining and better performance on downstream tasks. To facilitate reproducibility, we share our complete implementation and evaluation framework as open source: https://github.com/Ace95/SLAM-AGS.

SLAM-AGS: Slide-Label Aware Multi-Task Pretraining Using Adaptive Gradient Surgery in Computational Cytology

TL;DR

This work tackles learning from weak slide-level labels in computational cytology where witness rates are extremely low. It introduces SLAM-AGS, a slide-label aware multitask pretraining framework that combines a Similarity loss for slide-negative patches and a SimCLR-based contrastive loss for slide-positive patches, augmented by Adaptive Gradient Surgery to mitigate gradient conflicts. On a bone-marrow cytology dataset with simulated witness rates, SLAM-AGS yields superior bag-level F1 and Top-400 key-patch retrieval relative to weakly supervised and self-supervised baselines, with the largest gains at the lowest witness rates. The approach is released openly to facilitate reproducibility and demonstrates that managing gradient interference is crucial for stable, label-efficient pretraining in multi-task MIL settings.

Abstract

Computational cytology faces two major challenges: i) instance-level labels are unreliable and prohibitively costly to obtain, ii) witness rates are extremely low. We propose SLAM-AGS, a Slide-Label-Aware Multitask pretraining framework that jointly optimizes (i) a weakly supervised similarity objective on slide-negative patches and (ii) a self-supervised contrastive objective on slide-positive patches, yielding stronger performance on downstream tasks. To stabilize learning, we apply Adaptive Gradient Surgery to tackle conflicting task gradients and prevent model collapse. We integrate the pretrained encoder into an attention-based Multiple Instance Learning aggregator for bag-level prediction and attention-guided retrieval of the most abnormal instances in a bag. On a publicly available bone-marrow cytology dataset, with simulated witness rates from 10% down to 0.5%, SLAM-AGS improves bag-level F1-Score and Top 400 positive cell retrieval over other pretraining methods, with the largest gains at low witness rates, showing that resolving gradient interference enables stable pretraining and better performance on downstream tasks. To facilitate reproducibility, we share our complete implementation and evaluation framework as open source: https://github.com/Ace95/SLAM-AGS.

Paper Structure

This paper contains 13 sections, 6 equations, 2 figures, 2 tables.

Figures (2)

  • Figure 1: (a) F1-scores for the bag classification task across decreasing witness rates. (b) Recall at Top 400 patches (corresponding to top 40% of instances) of positive bags.
  • Figure 2: Examples of images extracted from the Bone Marrow Cytology dataset: (a) lymphocyte, (b) hairy cell, (c) promyelocyte, (d) basophil.