Table of Contents
Fetching ...

Proportion Estimation by Masked Learning from Label Proportion

Takumi Okuo, Kazuya Nishimura, Hiroaki Ito, Kazuhiro Terada, Akihiko Yoshizawa, Ryoma Bise

TL;DR

This paper tackles automatic estimation of the PD-L1 tumor proportion, defined as $r=\frac{N_{PD-L1+\,tumor}}{N_{tumor}}$, from pathological core images with limited cell-level annotations. It introduces a two-stage framework that first detects tumor cells to produce a tumor mask $M$, then estimates $\hat{r}$ from masked tumor maps using $s_p=\sum (F_p\odot M)$, $s_n=\sum (F_n\odot M)$ and $\hat{r}=\frac{s_p}{s_p+s_n}$, supervised by three annotation types (cell positions, tumor region, and proportion labels) and optimized with a weighted focal proportion loss. This loss extends LLP to handle interval-based proportion labels and data imbalance by weighting and focusing on hard intervals, with a hyperparameter $\gamma$ that can vary by proportion interval. Empirical results on clinical datasets show state-of-the-art accuracy and provide interpretable CAM-like maps that help clinicians understand the PD-L1 estimation while reducing annotation burden.

Abstract

The PD-L1 rate, the number of PD-L1 positive tumor cells over the total number of all tumor cells, is an important metric for immunotherapy. This metric is recorded as diagnostic information with pathological images. In this paper, we propose a proportion estimation method with a small amount of cell-level annotation and proportion annotation, which can be easily collected. Since the PD-L1 rate is calculated from only `tumor cells' and not using `non-tumor cells', we first detect tumor cells with a detection model. Then, we estimate the PD-L1 proportion by introducing a masking technique to `learning from label proportion.' In addition, we propose a weighted focal proportion loss to address data imbalance problems. Experiments using clinical data demonstrate the effectiveness of our method. Our method achieved the best performance in comparisons.

Proportion Estimation by Masked Learning from Label Proportion

TL;DR

This paper tackles automatic estimation of the PD-L1 tumor proportion, defined as , from pathological core images with limited cell-level annotations. It introduces a two-stage framework that first detects tumor cells to produce a tumor mask , then estimates from masked tumor maps using , and , supervised by three annotation types (cell positions, tumor region, and proportion labels) and optimized with a weighted focal proportion loss. This loss extends LLP to handle interval-based proportion labels and data imbalance by weighting and focusing on hard intervals, with a hyperparameter that can vary by proportion interval. Empirical results on clinical datasets show state-of-the-art accuracy and provide interpretable CAM-like maps that help clinicians understand the PD-L1 estimation while reducing annotation burden.

Abstract

The PD-L1 rate, the number of PD-L1 positive tumor cells over the total number of all tumor cells, is an important metric for immunotherapy. This metric is recorded as diagnostic information with pathological images. In this paper, we propose a proportion estimation method with a small amount of cell-level annotation and proportion annotation, which can be easily collected. Since the PD-L1 rate is calculated from only `tumor cells' and not using `non-tumor cells', we first detect tumor cells with a detection model. Then, we estimate the PD-L1 proportion by introducing a masking technique to `learning from label proportion.' In addition, we propose a weighted focal proportion loss to address data imbalance problems. Experiments using clinical data demonstrate the effectiveness of our method. Our method achieved the best performance in comparisons.
Paper Structure (4 sections, 1 equation, 5 figures, 1 table)

This paper contains 4 sections, 1 equation, 5 figures, 1 table.

Figures (5)

  • Figure 1: Illustration of a core image in PD-L1 test. Blue dots regions indicate negative tumors, and red indicates positive tumors. Ideally, the proportion of positive tumor cells is calculated by counting and classifying all cells; however, pathologists roughly estimate the ratio subjective without counting or segmenting.
  • Figure 2: Overview of proposed method. Top: cell detection network, and Bottom: proportion estimation network.
  • Figure 3: Left: focal proportion loss. Right: weighted focal proportion loss.
  • Figure 4: Example of core images in cases of (a) 0 to 1%, (b) 1 to 25%, (c) 25 to 50%, (d) 50 to 75%, (e) 75 to 100%.
  • Figure 5: Visualization by intermediate outputs and CAM-based method zhou2016learning. Red indicates a positive, and blue indicates a negative.