Advanced Deep Learning and Large Language Models: Comprehensive Insights for Cancer Detection

Yassine Habchi; Hamza Kheddar; Yassine Himeur; Adel Belouchrani; Erchin Serpedin; Fouad Khelifi; Muhammad E. H. Chowdhury

Advanced Deep Learning and Large Language Models: Comprehensive Insights for Cancer Detection

Yassine Habchi, Hamza Kheddar, Yassine Himeur, Adel Belouchrani, Erchin Serpedin, Fouad Khelifi, Muhammad E. H. Chowdhury

TL;DR

This paper surveys how advanced DL methods—reinforcement learning, federated learning, transfer learning, transformers, and large language models—address cancer detection across imaging, pathology, and omics data. It highlights how RL optimizes diagnostic pathways, FL preserves privacy in multi-institution settings, TL mitigates data scarcity, and Transformers/LLMs enable multimodal and text-rich analysis. Key contributions include mapping techniques to cancer-detection tasks, outlining evaluation metrics and datasets, and identifying challenges such as data imbalance and interpretability with proposed remedies like data augmentation and RAG for LLMs. The work provides a comprehensive resource for researchers and clinicians to adopt and adapt these techniques for improved detection accuracy, generalization, and clinical decision support.

Abstract

The rapid advancement of deep learning (DL) has transformed healthcare, particularly in cancer detection and diagnosis. DL surpasses traditional machine learning and human accuracy, making it a critical tool for identifying diseases. Despite numerous reviews on DL in healthcare, a comprehensive analysis of its role in cancer detection remains limited. Existing studies focus on specific aspects, leaving gaps in understanding its broader impact. This paper addresses these gaps by reviewing advanced DL techniques, including transfer learning (TL), reinforcement learning (RL), federated learning (FL), Transformers, and large language models (LLMs). These approaches enhance accuracy, tackle data scarcity, and enable decentralized learning while maintaining data privacy. TL adapts pre-trained models to new datasets, improving performance with limited labeled data. RL optimizes diagnostic pathways and treatment strategies, while FL fosters collaborative model development without sharing sensitive data. Transformers and LLMs, traditionally used in natural language processing, are now applied to medical data for improved interpretability. Additionally, this review examines these techniques' efficiency in cancer diagnosis, addresses challenges like data imbalance, and proposes solutions. It serves as a resource for researchers and practitioners, providing insights into current trends and guiding future research in advanced DL for cancer detection.

Advanced Deep Learning and Large Language Models: Comprehensive Insights for Cancer Detection

TL;DR

Abstract

Advanced Deep Learning and Large Language Models: Comprehensive Insights for Cancer Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)