Layer Separation: Adjustable Joint Space Width Images Synthesis in Conventional Radiography
Haolin Wang, Yafei Ou, Prasoon Ambalathankandy, Gen Ota, Pengyu Dai, Masayuki Ikebe, Kenji Suzuki, Tamotsu Kamishima
TL;DR
The paper addresses data quality and annotation bottlenecks in RA JSW analysis by introducing Layer Separation Networks (LSN) to separate soft tissue and bone layers in finger radiographs and synthesize adjustable JSW images with ground-truth annotations. LSN combines a generation network, segmentation supervision, soft-tissue discrimination, random shifting, and radiography-consistent reconstruction, enabling two-stage training with pseudo-images and yielding layer images $L=igl\{L_0,L_1,L_2\bigr\}$. The approach achieves realistic reconstructions, robust layer separation even with overlaps, and improves downstream tasks (JSN progress, JSW quantification, SvdH-like scoring) through synthetic-data pre-training, while maintaining clinical plausibility validated by a Visual Turing Test. The work demonstrates that synthetic data can address imbalanced JSW distributions and annotation scarcity, potentially accelerating RA-CAD development and achieving more robust disease monitoring. Code and dataset availability are anticipated, amplifying practical impact for clinical radiology and machine learning research.
Abstract
Rheumatoid arthritis (RA) is a chronic autoimmune disease characterized by joint inflammation and progressive structural damage. Joint space width (JSW) is a critical indicator in conventional radiography for evaluating disease progression, which has become a prominent research topic in computer-aided diagnostic (CAD) systems. However, deep learning-based radiological CAD systems for JSW analysis face significant challenges in data quality, including data imbalance, limited variety, and annotation difficulties. This work introduced a challenging image synthesis scenario and proposed Layer Separation Networks (LSN) to accurately separate the soft tissue layer, the upper bone layer, and the lower bone layer in conventional radiographs of finger joints. Using these layers, the adjustable JSW images can be synthesized to address data quality challenges and achieve ground truth (GT) generation. Experimental results demonstrated that LSN-based synthetic images closely resemble real radiographs, and significantly enhanced the performance in downstream tasks. The code and dataset will be available.
