2017 Robotic Instrument Segmentation Challenge
Max Allan, Alex Shvets, Thomas Kurmann, Zichen Zhang, Rahul Duggal, Yun-Hsuan Su, Nicola Rieke, Iro Laina, Niveditha Kalavakonda, Sebastian Bodenstedt, Luis Herrera, Wenqi Li, Vladimir Iglovikov, Huoling Luo, Jian Yang, Danail Stoyanov, Lena Maier-Hein, Stefanie Speidel, Mahdi Azizian
TL;DR
The study presents the 2017 Robotic Instrument Segmentation Challenge, delivering a hand-labeled porcine nephrectomy dataset for three segmentation tasks (binary, parts, and type) and evaluating ten teams across mean IoU metrics. A range of architectures—from residual CNNs and cascaded FCNs to TernausNet and ToolNet—are explored, with data augmentation and specialized preprocessing playing key roles. MIT consistently achieved top performance across binary, parts, and many type segmentation datasets, highlighting the value of pre-trained encoders and multi-task training. The work also discusses limitations of a small dataset and labeling challenges, and points to 2018 efforts to provide denser tissue annotations to push toward more comprehensive surgical scene understanding.
Abstract
In mainstream computer vision and machine learning, public datasets such as ImageNet, COCO and KITTI have helped drive enormous improvements by enabling researchers to understand the strengths and limitations of different algorithms via performance comparison. However, this type of approach has had limited translation to problems in robotic assisted surgery as this field has never established the same level of common datasets and benchmarking methods. In 2015 a sub-challenge was introduced at the EndoVis workshop where a set of robotic images were provided with automatically generated annotations from robot forward kinematics. However, there were issues with this dataset due to the limited background variation, lack of complex motion and inaccuracies in the annotation. In this work we present the results of the 2017 challenge on robotic instrument segmentation which involved 10 teams participating in binary, parts and type based segmentation of articulated da Vinci robotic instruments.
