DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic Surgery
Yuning Zhou, Henry Badgery, Matthew Read, James Bailey, Catherine E. Davey
TL;DR
The paper addresses domain-specific augmentation selection for contrastive self-supervised learning in medical imaging, focusing on laparoscopic surgery. It introduces Dimensionality Driven Augmentation Search ($DDA$), which differentiably optimizes augmentation policies by maximizing the local intrinsic dimensionality ($LID$) of representations, using a proxy objective that does not require downstream labels. Empirically, $DDA$ yields consistent improvements over standard SimCLR and SelfAugment baselines on linear evaluation and downstream segmentation across SVHM and Cholec80, and reveals that color-based augmentations like hue are not advantageous for laparoscopic imagery. The method is computationally efficient, navigating large policy spaces in hours and providing domain-relevant insights into effective augmentations for medical SSL.
Abstract
Self-supervised learning (SSL) has potential for effective representation learning in medical imaging, but the choice of data augmentation is critical and domain-specific. It remains uncertain if general augmentation policies suit surgical applications. In this work, we automate the search for suitable augmentation policies through a new method called Dimensionality Driven Augmentation Search (DDA). DDA leverages the local dimensionality of deep representations as a proxy target, and differentiably searches for suitable data augmentation policies in contrastive learning. We demonstrate the effectiveness and efficiency of DDA in navigating a large search space and successfully identifying an appropriate data augmentation policy for laparoscopic surgery. We systematically evaluate DDA across three laparoscopic image classification and segmentation tasks, where it significantly improves over existing baselines. Furthermore, DDA's optimised set of augmentations provides insight into domain-specific dependencies when applying contrastive learning in medical applications. For example, while hue is an effective augmentation for natural images, it is not advantageous for laparoscopic images.
