Toward Cost-efficient Adaptive Clinical Trials in Knee Osteoarthritis with Reinforcement Learning
Khanh Nguyen, Huy Hoang Nguyen, Egor Panfilov, Aleksei Tiulpin
TL;DR
This work tackles cost-efficient data collection in adaptive KOA trials by framing Active Sensing as an RL problem. The authors design a Q-network–based agent that chooses follow-up versus skip, using a multimodal, patient-level state that fuses imaging biomarkers (notably fJSW-derived measures) with clinical data across both knees. A novel reward function ties together follow-up costs, data utility, and progression timing to produce an economically favorable policy, with ablations guiding parameter choices. Trained on the OAI dataset, the RL method outperforms baselines in BA and recall and achieves positive reward per person (RPP) while reducing follow-up costs, demonstrating potential for more cost-effective KOA trials and data collection in broader clinical settings. The approach is fully automatic at test time and is released publicly to spur adoption and further refinement in adaptive trial design as well as other chronic diseases.
Abstract
Osteoarthritis (OA) is the most common musculoskeletal disease, with knee OA (KOA) being one of the leading causes of disability and a significant economic burden. Predicting KOA progression is crucial for improving patient outcomes, optimizing healthcare resources, studying the disease, and developing new treatments. The latter application particularly requires one to understand the disease progression in order to collect the most informative data at the right time. Existing methods, however, are limited by their static nature and their focus on individual joints, leading to suboptimal predictive performance and downstream utility. Our study proposes a new method that allows to dynamically monitor patients rather than individual joints with KOA using a novel Active Sensing (AS) approach powered by Reinforcement Learning (RL). Our key idea is to directly optimize for the downstream task by training an agent that maximizes informative data collection while minimizing overall costs. Our RL-based method leverages a specially designed reward function to monitor disease progression across multiple body parts, employs multimodal deep learning, and requires no human input during testing. Extensive numerical experiments demonstrate that our approach outperforms current state-of-the-art models, paving the way for the next generation of KOA trials.
