PRISM-Consult: A Panel-of-Experts Architecture for Clinician-Aligned Diagnosis
Lionel Levine, John Santerre, Alexander S. Young, T. Barry Levine, Francis Campion, Majid Sarrafzadeh
TL;DR
PRISM-Consult extends a compact ED decision-support backbone to a routed panel of domain specialists, dispatching episodes to Cardiac-Vascular, Pulmonary, Gastro-Oesophageal, Musculoskeletal, and Psychogenic experts based on early token signals. The approach preserves a shared tokenization template and embeddings, while training lightweight domain adapters (LoRA) and using a calibrated router to balance safety, coverage, and compute savings versus consult-all. Across five domains, specialists converge with low development perplexities near 2.0 and the router achieves high life-threat recall with substantial reductions in consulted experts, enabling auditable, real-time consult at scale. The work outlines data, calibration, and validation steps to support prospective deployment, and discusses plans for external validation, thresholding refinements, and expanded domain coverage to enhance clinical safety and scalability.
Abstract
We present PRISM-Consult, a clinician-aligned panel-of-experts architecture that extends the compact PRISM sequence model into a routed family of domain specialists. Episodes are tokenized as structured clinical events; a light-weight router reads the first few tokens and dispatches to specialist models (Cardiac-Vascular, Pulmonary, Gastro-Oesophageal, Musculoskeletal, Psychogenic). Each specialist inherits PRISM's small transformer backbone and token template, enabling parameter efficiency and interpretability. On real-world Emergency Department cohorts, specialists exhibit smooth convergence with low development perplexities across domains, while the router achieves high routing quality and large compute savings versus consult-all under a safety-first policy. We detail the data methodology (initial vs. conclusive ICD-9 families), routing thresholds and calibration, and report per-domain results to avoid dominance by common events. The framework provides a practical path to safe, auditable, and low-latency consult at scale, and we outline validation steps-external/temporal replication, asymmetric life-threat thresholds, and multi-label arbitration-to meet prospective clinical deployment standards.
