Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

Lokesha Rasanjalee; Jin Lin Tan; Dileepa Pitawela; Rajvinder Singh; Hsiang-Ting Chen

Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

Lokesha Rasanjalee, Jin Lin Tan, Dileepa Pitawela, Rajvinder Singh, Hsiang-Ting Chen

TL;DR

This work systematically study how annotation errors propagate across different prompt types, namely masks, boxes, and points, and proposes Learning-to-Re-Prompt (L2RP), a cost-aware framework that learns when and where to seek expert input.

Abstract

Accurate annotation of endoscopic videos is essential yet time-consuming, particularly for challenging datasets such as dysplasia in Barrett's esophagus, where the affected regions are irregular and lack clear boundaries. Semi-automatic tools like Segment Anything Model 2 (SAM2) can ease this process by propagating annotations across frames, but small errors often accumulate and reduce accuracy, requiring expert review and correction. To address this, we systematically study how annotation errors propagate across different prompt types, namely masks, boxes, and points, and propose Learning-to-Re-Prompt (L2RP), a cost-aware framework that learns when and where to seek expert input. By tuning a human-cost parameter, our method balances annotation effort and segmentation accuracy. Experiments on a private Barrett's dysplasia dataset and the public SUN-SEG benchmark demonstrate improved temporal consistency and superior performance over baseline strategies.

Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

TL;DR

Abstract

Paper Structure (13 sections, 3 equations, 2 figures, 1 table)

This paper contains 13 sections, 3 equations, 2 figures, 1 table.

Introduction
Related Work
Method
Preliminaries
Learning-to-Re-Prompt
Experiment Setting
Results and Discussion
Difference in Prompt Types and Error Propagation
Efficient Expert Intervention through L2RP
Clinical Interpretation and Practical Use of $\lambda_{\text{corr}}$
Conclusion
Compliance with Ethical Standards
Acknowledgments

Figures (2)

Figure 1: Example Barrett’s video showing mask propagation from the initial prompt and after an L2RP-suggested correction, with improved annotation quality.
Figure 2: Mean Dice Loss averaged across all videos at each frame, showing how segmentation error changes over time for different prompt types. Smoothed (20-frame window). Shaded regions indicate 95% confidence intervals.

Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

TL;DR

Abstract

Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

Authors

TL;DR

Abstract

Table of Contents

Figures (2)