CellScout: Visual Analytics for Mining Biomarkers in Cell State Discovery
Rui Sheng, Zelin Zang, Jiachen Wang, Yan Luo, Zixin Chen, Yan Zhou, Shaolun Ruan, Huamin Qu
TL;DR
CellScout tackles the co-discovery bottleneck in cell-state biology by jointly mining associations between cell populations and biomarkers using a Mixture-of-Experts (MoE) model and a four-view visual analytics interface. The MoE mining framework optimizes discriminative power and mutual information retention under a cell-representation constraint to produce interpretable association relationships, which are explored via AI Miner, Cell Exploration, Comparison, and Verification views. Validated through expert interviews and a real-world case study, the system demonstrates the ability to reveal novel cell states and robust biomarker candidates, guiding biologists beyond traditional clustering-based approaches. The work contributes a novel MoE-based mining method for biomarker discovery and an interpretable visualization design that supports human-in-the-loop refinement, with promising directions toward multimodal knowledge integration and AI-assisted biomarker validation.
Abstract
Cell state discovery is crucial for understanding biological systems and enhancing medical outcomes. A key aspect of this process is identifying distinct biomarkers that define specific cell states. However, difficulties arise from the co-discovery process of cell states and biomarkers: biologists often use dimensionality reduction to visualize cells in a two-dimensional space. Then they usually interpret visually clustered cells as distinct states, from which they seek to identify unique biomarkers. However, this assumption is often invalid due to internal inconsistencies in a cluster, making the process trial-and-error and highly uncertain. Therefore, biologists urgently need effective tools to help uncover the hidden association relationships between different cell populations and their potential biomarkers. To address this problem, we first designed a machine-learning algorithm based on the Mixture-of-Experts (MoE) technique to identify meaningful associations between cell populations and biomarkers. We further developed a visual analytics system, CellScout, in collaboration with biologists, to help them explore and refine these association relationships to advance cell state discovery. We validated our system through expert interviews, from which we further selected a representative case to demonstrate its effectiveness in discovering new cell states.
