R$^{2}$Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection

Shuaike Shen; Ke Liu; Jiaqing Xie; Shangde Gao; Chunhua Shen; Ge Liu; Mireia Crispin-Ortuzar; Shangqi Gao

R$^{2}$Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection

Shuaike Shen, Ke Liu, Jiaqing Xie, Shangde Gao, Chunhua Shen, Ge Liu, Mireia Crispin-Ortuzar, Shangqi Gao

TL;DR

R$^2$-Seg addresses the challenge of out-of-distribution tumor segmentation by combining a training-free, anatomy-aware Reason step with a statistically principled Reject step. An LLM-guided planner localizes organ anchors and generates multi-scale ROIs, constraining a frozen segmentation model to anatomically plausible regions. A nonparametric two-sample test with FDR control (via $MMD^2$ and permutation testing) filters out false positives, significantly improving specificity while preserving sensitivity. The approach avoids catastrophic forgetting by not updating model weights, and achieves strong OOD performance across multi-center, multi-modal benchmarks, enabling safer, deployment-friendly tumor parsing.

Abstract

Foundation models for medical image segmentation struggle under out-of-distribution (OOD) shifts, often producing fragmented false positives on OOD tumors. We introduce R$^{2}$Seg, a training-free framework for robust OOD tumor segmentation that operates via a two-stage Reason-and-Reject process. First, the Reason step employs an LLM-guided anatomical reasoning planner to localize organ anchors and generate multi-scale ROIs. Second, the Reject step applies two-sample statistical testing to candidates generated by a frozen foundation model (BiomedParse) within these ROIs. This statistical rejection filter retains only candidates significantly different from normal tissue, effectively suppressing false positives. Our framework requires no parameter updates, making it compatible with zero-update test-time augmentation and avoiding catastrophic forgetting. On multi-center and multi-modal tumor segmentation benchmarks, R$^{2}$Seg substantially improves Dice, specificity, and sensitivity over strong baselines and the original foundation models. Code are available at https://github.com/Eurekashen/R2Seg.

R$^{2}$Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection

TL;DR

-Seg addresses the challenge of out-of-distribution tumor segmentation by combining a training-free, anatomy-aware Reason step with a statistically principled Reject step. An LLM-guided planner localizes organ anchors and generates multi-scale ROIs, constraining a frozen segmentation model to anatomically plausible regions. A nonparametric two-sample test with FDR control (via

and permutation testing) filters out false positives, significantly improving specificity while preserving sensitivity. The approach avoids catastrophic forgetting by not updating model weights, and achieves strong OOD performance across multi-center, multi-modal benchmarks, enabling safer, deployment-friendly tumor parsing.

Abstract

Foundation models for medical image segmentation struggle under out-of-distribution (OOD) shifts, often producing fragmented false positives on OOD tumors. We introduce R

Seg, a training-free framework for robust OOD tumor segmentation that operates via a two-stage Reason-and-Reject process. First, the Reason step employs an LLM-guided anatomical reasoning planner to localize organ anchors and generate multi-scale ROIs. Second, the Reject step applies two-sample statistical testing to candidates generated by a frozen foundation model (BiomedParse) within these ROIs. This statistical rejection filter retains only candidates significantly different from normal tissue, effectively suppressing false positives. Our framework requires no parameter updates, making it compatible with zero-update test-time augmentation and avoiding catastrophic forgetting. On multi-center and multi-modal tumor segmentation benchmarks, R

Seg substantially improves Dice, specificity, and sensitivity over strong baselines and the original foundation models. Code are available at https://github.com/Eurekashen/R2Seg.

R$^{2}$Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection

TL;DR

Abstract

R$^{2}$Seg: Training-Free OOD Medical Tumor Segmentation via Anatomical Reasoning and Statistical Rejection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)