2018 Robotic Scene Segmentation Challenge

Max Allan; Satoshi Kondo; Sebastian Bodenstedt; Stefan Leger; Rahim Kadkhodamohammadi; Imanol Luengo; Felix Fuentes; Evangello Flouty; Ahmed Mohammed; Marius Pedersen; Avinash Kori; Varghese Alex; Ganapathy Krishnamurthi; David Rauber; Robert Mendel; Christoph Palm; Sophia Bano; Guinther Saibro; Chi-Sheng Shih; Hsun-An Chiang; Juntang Zhuang; Junlin Yang; Vladimir Iglovikov; Anton Dobrenkii; Madhu Reddiboina; Anubhav Reddy; Xingtong Liu; Cong Gao; Mathias Unberath; Myeonghyeon Kim; Chanho Kim; Chaewon Kim; Hyejin Kim; Gyeongmin Lee; Ihsan Ullah; Miguel Luna; Sang Hyun Park; Mahdi Azizian; Danail Stoyanov; Lena Maier-Hein; Stefanie Speidel

2018 Robotic Scene Segmentation Challenge

Max Allan, Satoshi Kondo, Sebastian Bodenstedt, Stefan Leger, Rahim Kadkhodamohammadi, Imanol Luengo, Felix Fuentes, Evangello Flouty, Ahmed Mohammed, Marius Pedersen, Avinash Kori, Varghese Alex, Ganapathy Krishnamurthi, David Rauber, Robert Mendel, Christoph Palm, Sophia Bano, Guinther Saibro, Chi-Sheng Shih, Hsun-An Chiang, Juntang Zhuang, Junlin Yang, Vladimir Iglovikov, Anton Dobrenkii, Madhu Reddiboina, Anubhav Reddy, Xingtong Liu, Cong Gao, Mathias Unberath, Myeonghyeon Kim, Chanho Kim, Chaewon Kim, Hyejin Kim, Gyeongmin Lee, Ihsan Ullah, Miguel Luna, Sang Hyun Park, Mahdi Azizian, Danail Stoyanov, Lena Maier-Hein, Stefanie Speidel

TL;DR

This paper presents the 2018 EndoVis robotic scene segmentation challenge, expanding beyond prior instrument-focused tasks to include anatomical and medical-device classes using porcine laparoscopic data. It catalogs a diverse set of segmentation approaches (DeepLabV3+, PSPNet, U-Nets, and stereo-aware architectures) and a broad array of training strategies across 19 sequences (15 train, 4 test). The study demonstrates mixed success across classes, with kidney-related structures and heavily occluded surfaces posing the greatest challenges, and identifies Regensburg as a leading performer across multiple datasets. Overall, the work advances surgical scene understanding by benchmarking multi-class, multi-object segmentation in realistic MIS imagery and highlighting avenues for robustness under occlusion, class imbalance, and domain shift, enabling improved AR overlays and decision support in robotic surgery.

Abstract

In 2015 we began a sub-challenge at the EndoVis workshop at MICCAI in Munich using endoscope images of ex-vivo tissue with automatically generated annotations from robot forward kinematics and instrument CAD models. However, the limited background variation and simple motion rendered the dataset uninformative in learning about which techniques would be suitable for segmentation in real surgery. In 2017, at the same workshop in Quebec we introduced the robotic instrument segmentation dataset with 10 teams participating in the challenge to perform binary, articulating parts and type segmentation of da Vinci instruments. This challenge included realistic instrument motion and more complex porcine tissue as background and was widely addressed with modifications on U-Nets and other popular CNN architectures. In 2018 we added to the complexity by introducing a set of anatomical objects and medical devices to the segmented classes. To avoid over-complicating the challenge, we continued with porcine data which is dramatically simpler than human tissue due to the lack of fatty tissue occluding many organs.

2018 Robotic Scene Segmentation Challenge

TL;DR

Abstract

2018 Robotic Scene Segmentation Challenge

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)