Magical: Medical Lay Language Generation via Semantic Invariance and Layperson-tailored Adaptation
Weibin Liao, Tianlong Wang, Yinghao Zhu, Yasha Wang, Junyi Gao, Liantao Ma
TL;DR
Medical Lay Language Generation (MLLG) faces semantic fidelity and diverse lay-style generation challenges when fine-tuning on heterogeneous datasets. Magical introduces an asymmetric LoRA framework with a shared matrix $A$ for abstractive summarization and multiple per-style matrices $B$ to capture diverse lay styles, augmented by a Semantic Invariance Constraint on $A$ and a Recommendation-guided Switch to select the appropriate $B$. Empirical results across three real-world datasets and multiple backbones show that Magical consistently surpasses prompt-based methods and standard LoRA variants, while reducing trainable parameters by about 31.66%. The approach demonstrates strong semantic alignment and improved lay-language quality, with ablations confirming the importance of semantic constraints and multi-style adaptation. These findings suggest a modular, semantically-aware, style-aware fine-tuning paradigm that can generalize across domains and support more accessible medical narratives.
Abstract
Medical Lay Language Generation (MLLG) plays a vital role in improving the accessibility of complex scientific content for broader audiences. Recent literature to MLLG commonly employ parameter-efficient fine-tuning methods such as Low-Rank Adaptation (LoRA) to fine-tuning large language models (LLMs) using paired expert-lay language datasets. However, LoRA struggles with the challenges posed by multi-source heterogeneous MLLG datasets. Specifically, through a series of exploratory experiments, we reveal that standard LoRA fail to meet the requirement for semantic fidelity and diverse lay-style generation in MLLG task. To address these limitations, we propose Magical, an asymmetric LoRA architecture tailored for MLLG under heterogeneous data scenarios. Magical employs a shared matrix $A$ for abstractive summarization, along with multiple isolated matrices $B$ for diverse lay-style generation. To preserve semantic fidelity during the lay language generation process, Magical introduces a Semantic Invariance Constraint to mitigate semantic subspace shifts on matrix $A$. Furthermore, to better adapt to diverse lay-style generation, Magical incorporates the Recommendation-guided Switch, an externally interface to prompt the LLM to switch between different matrices $B$. Experimental results on three real-world lay language generation datasets demonstrate that Magical consistently outperforms prompt-based methods, vanilla LoRA, and its recent variants, while also reducing trainable parameters by 31.66%. Our code is publicly available at https://github.com/tianlwang/Magical.git.
