Table of Contents
Fetching ...

The Rise of Small Language Models in Healthcare: A Comprehensive Survey

Muskan Garg, Shaina Raza, Shebuti Rayana, Xingyi Liu, Sunghwan Sohn

TL;DR

The paper tackles the challenge of deploying AI in healthcare under privacy, latency, and resource constraints by surveying small language models (SLMs) and proposing a comprehensive taxonomy. It covers architectural foundations (data, tokenization, attention), adaptation strategies (prompting, in-context learning, fine-tuning), and compression techniques (distillation, pruning, quantization), along with current-state benchmarks and safety assessments. Key contributions include a proposed taxonomy for tuning and compression, a timeline of healthcare SLMs, a curated open-resource inventory, and sustainability metrics such as carbon footprints to guide practical deployment. The work highlights the potential for on-device, privacy-preserving CDSS and clinician-friendly AI tools, while outlining critical challenges—benchmarking, data access, bias, and regulatory alignment—that must be addressed to realize real-world impact.

Abstract

Despite substantial progress in healthcare applications driven by large language models (LLMs), growing concerns around data privacy, and limited resources; the small language models (SLMs) offer a scalable and clinically viable solution for efficient performance in resource-constrained environments for next-generation healthcare informatics. Our comprehensive survey presents a taxonomic framework to identify and categorize them for healthcare professionals and informaticians. The timeline of healthcare SLM contributions establishes a foundational framework for analyzing models across three dimensions: NLP tasks, stakeholder roles, and the continuum of care. We present a taxonomic framework to identify the architectural foundations for building models from scratch; adapting SLMs to clinical precision through prompting, instruction fine-tuning, and reasoning; and accessibility and sustainability through compression techniques. Our primary objective is to offer a comprehensive survey for healthcare professionals, introducing recent innovations in model optimization and equipping them with curated resources to support future research and development in the field. Aiming to showcase the groundbreaking advancements in SLMs for healthcare, we present a comprehensive compilation of experimental results across widely studied NLP tasks in healthcare to highlight the transformative potential of SLMs in healthcare. The updated repository is available at Github

The Rise of Small Language Models in Healthcare: A Comprehensive Survey

TL;DR

The paper tackles the challenge of deploying AI in healthcare under privacy, latency, and resource constraints by surveying small language models (SLMs) and proposing a comprehensive taxonomy. It covers architectural foundations (data, tokenization, attention), adaptation strategies (prompting, in-context learning, fine-tuning), and compression techniques (distillation, pruning, quantization), along with current-state benchmarks and safety assessments. Key contributions include a proposed taxonomy for tuning and compression, a timeline of healthcare SLMs, a curated open-resource inventory, and sustainability metrics such as carbon footprints to guide practical deployment. The work highlights the potential for on-device, privacy-preserving CDSS and clinician-friendly AI tools, while outlining critical challenges—benchmarking, data access, bias, and regulatory alignment—that must be addressed to realize real-world impact.

Abstract

Despite substantial progress in healthcare applications driven by large language models (LLMs), growing concerns around data privacy, and limited resources; the small language models (SLMs) offer a scalable and clinically viable solution for efficient performance in resource-constrained environments for next-generation healthcare informatics. Our comprehensive survey presents a taxonomic framework to identify and categorize them for healthcare professionals and informaticians. The timeline of healthcare SLM contributions establishes a foundational framework for analyzing models across three dimensions: NLP tasks, stakeholder roles, and the continuum of care. We present a taxonomic framework to identify the architectural foundations for building models from scratch; adapting SLMs to clinical precision through prompting, instruction fine-tuning, and reasoning; and accessibility and sustainability through compression techniques. Our primary objective is to offer a comprehensive survey for healthcare professionals, introducing recent innovations in model optimization and equipping them with curated resources to support future research and development in the field. Aiming to showcase the groundbreaking advancements in SLMs for healthcare, we present a comprehensive compilation of experimental results across widely studied NLP tasks in healthcare to highlight the transformative potential of SLMs in healthcare. The updated repository is available at Github

Paper Structure

This paper contains 25 sections, 5 figures, 7 tables.

Figures (5)

  • Figure 1: Opening remarks of the study rationale: (a) Defined scope of the study to investigate the NLP-centered SLM development for healthcare professionals; (b) Monthly downloads for 7B and 13B parameters model from Huggingface for both MentaLLAMA and MedAlpaca.
  • Figure 2: Taxonomic overview of the study design for the rise of SLMs in healthcare.
  • Figure 3: Timeline of SLM development in healthcare. Transforming NLP-driven clinical decision-making, NLP tasks for clinical conditions, and doctor-patient conversation or mental health analysis.
  • Figure 4: Compiled performance of healthcare SLMs for the most commonly leveraged datasets in healthcare-specific NLP tasks.
  • Figure 5: The role of SLM in healthcare