Robot-Enabled Machine Learning-Based Diagnosis of Gastric Cancer Polyps Using Partial Surface Tactile Imaging
Siddhartha Kapuria, Jeff Bonyun, Yash Kulkarni, Naruhiko Ikoma, Sandeep Chinchali, Farshid Alambeigi
TL;DR
This work tackles the challenge of endoscopic gastric cancer diagnosis under data scarcity by introducing a Vision-based Tactile Sensor (HySenSe) and a robot-assisted data collection pipeline to capture partial surface textures from realistic AGC phantoms. A Dilated ResNet model, trained on a large, synthetically generated tactile texture dataset, achieves high macro-averaged performance (e.g., AUC ≈ $0.999$, Accuracy ≈ $0.967$) and demonstrates robustness across mixed morphologies and partial contact. The approach addresses data distribution biases and the limitations of endoscopic imaging, showing promise for texture-based AGC polyp classification and guiding future clinical validation and Sim2Real transfer. Overall, the combination of VTS, robotic sampling, and a lightweight yet effective CNN architecture provides a feasible path toward texture-aware diagnostic assistance for AGC lesions. "$4\,\text{cm}^2$ coverage, $50\,\mu\text{m}$ texture resolution, and $3\text{ N}$ interaction cap" are among the concrete constraints that the study successfully manages to enable robust learning from partial observations.
Abstract
In this paper, to collectively address the existing limitations on endoscopic diagnosis of Advanced Gastric Cancer (AGC) Tumors, for the first time, we propose (i) utilization and evaluation of our recently developed Vision-based Tactile Sensor (VTS), and (ii) a complementary Machine Learning (ML) algorithm for classifying tumors using their textural features. Leveraging a seven DoF robotic manipulator and unique custom-designed and additively-manufactured realistic AGC tumor phantoms, we demonstrated the advantages of automated data collection using the VTS addressing the problem of data scarcity and biases encountered in traditional ML-based approaches. Our synthetic-data-trained ML model was successfully evaluated and compared with traditional ML models utilizing various statistical metrics even under mixed morphological characteristics and partial sensor contact.
