Theoretical Data-Driven MobilePosenet: Lightweight Neural Network for Accurate Calibration-Free 5-DOF Magnet Localization
Wenxuan Xie, Yuelin Zhang, Jiwei Shan, Hongzhe Sun, Jiewen Tan, Shing Shin Cheng
TL;DR
This work tackles real-time, calibration-free 5-DOF magnet localization for wireless capsule endoscopy by replacing data-intensive real-world training with theoretical-data–driven learning. It introduces MobilePosenet, a lightweight end-to-end network built on inverted residual attention blocks and depthwise separable convolutions, augmented with sensor-coordinate inputs and random noise to bridge theory and practice. Trained solely on the magnetic-dipole model, it achieves high localization accuracy ($1.54\pm1.03$ mm, $2.24\pm1.84^{\circ}$) with low latency ($0.91$ ms on GPU) and shows robustness against calibration requirements that hinder LM-based and other data-driven approaches. The approach promises rapid deployment across clinical settings by eliminating hardware calibration and real-world data collection, while maintaining competitive or superior performance relative to state-of-the-art methods.
Abstract
Permanent magnet tracking using the external sensor array is crucial for the accurate localization of wireless capsule endoscope robots. Traditional tracking algorithms, based on the magnetic dipole model and Levenberg-Marquardt (LM) algorithm, face challenges related to computational delays and the need for initial position estimation. More recently proposed neural network-based approaches often require extensive hardware calibration and real-world data collection, which are time-consuming and labor-intensive. To address these challenges, we propose MobilePosenet, a lightweight neural network architecture that leverages depthwise separable convolutions to minimize computational cost and a channel attention mechanism to enhance localization accuracy. Besides, the inputs to the network integrate the sensors' coordinate information and random noise, compensating for the discrepancies between the theoretical model and the actual magnetic fields and thus allowing MobilePosenet to be trained entirely on theoretical data. Experimental evaluations conducted in a \(90 \times 90 \times 80\) mm workspace demonstrate that MobilePosenet exhibits excellent 5-DOF localization accuracy ($1.54 \pm 1.03$ mm and $2.24 \pm 1.84^{\circ}$) and inference speed (0.9 ms) against state-of-the-art methods trained on real-world data. Since network training relies solely on theoretical data, MobilePosenet can eliminate the hardware calibration and real-world data collection process, improving the generalizability of this permanent magnet localization method and the potential for rapid adoption in different clinical settings.
