Preoperative Prognosis Assessment of Lumbar Spinal Surgery for Low Back Pain and Sciatica Patients based on Multimodalities and Multimodal Learning
Li-Chin Chen, Jung-Nien Lai, Hung-En Lin, Hsien-Te Chen, Kuo-Hsuan Hung, Yu Tsao
TL;DR
The study tackles the challenge of predicting preoperative outcomes for lumbar spinal surgery in LBP and sciatica by integrating multimodal data—standard clinical assessments, Traditional Chinese Medicine body constitution, planned surgical approach, and vowel pronunciation—within a multimodal learning framework. It systematically compares uni- and multi-modal configurations across early, joint, and late fusion strategies, using tabular data, free text, and acoustic features processed by CNNs, PubMedBERT-based NLP, and LSTM encoders, with CatBoost and other tree methods for unimodal tabular data. The key finding is that a combination of tabular data, text, and surgical plan information achieves the best accuracy (0.81) and AUROC (0.83) in a 105-patient cohort, with joint fusion offering robust performance and interpretable signals via IG and CCA analyses. The work demonstrates a practical, light-weight preoperative prognosis tool and provides actionable insights into modality contributions and fusion strategies for clinical decision support, bridging Eastern medicine concepts with modern ML techniques. Future work should scale sample size, refine audio features, and further validate interpretability in clinical settings.
Abstract
Low back pain (LBP) and sciatica may require surgical therapy when they are symptomatic of severe pain. However, there is no effective measures to evaluate the surgical outcomes in advance. This work combined elements of Eastern medicine and machine learning, and developed a preoperative assessment tool to predict the prognosis of lumbar spinal surgery in LBP and sciatica patients. Standard operative assessments, traditional Chinese medicine body constitution assessments, planned surgical approach, and vowel pronunciation recordings were collected and stored in different modalities. Our work provides insights into leveraging modality combinations, multimodals, and fusion strategies. The interpretability of models and correlations between modalities were also inspected. Based on the recruited 105 patients, we found that combining standard operative assessments, body constitution assessments, and planned surgical approach achieved the best performance in 0.81 accuracy. Our approach is effective and can be widely applied in general practice due to simplicity and effective.
