TRACE: Transformer-based Risk Assessment for Clinical Evaluation
Dionysis Christopoulos, Sotiris Spanos, Valsamis Ntouskos, Konstantinos Karantzalos
TL;DR
TRACE addresses clinical risk assessment with highly multi-modal, often incomplete tabular data by introducing a Transformer-based architecture that unifies continuous, categorical, and checkbox features into a single embedding space processed by a Transformer encoder. A strong non-negative MLP baseline (nnMLP) is proposed for comparison, highlighting interpretability through monotonic risk contributions. TRACE explicitly handles missing values and provides attention-based explanations via feature maps, achieving competitive or state-of-the-art performance across six diverse clinical datasets with favorable parameter efficiency. The work demonstrates robust handling of class imbalance through focal loss and ablation studies confirm the value of checkbox embeddings and the proposed data representations. Overall, TRACE offers practical, interpretable, and efficient risk assessment suitable for clinical decision support, with publicly available code for reproducibility.
Abstract
We present TRACE (Transformer-based Risk Assessment for Clinical Evaluation), a novel method for clinical risk assessment based on clinical data, leveraging the self-attention mechanism for enhanced feature interaction and result interpretation. Our approach is able to handle different data modalities, including continuous, categorical and multiple-choice (checkbox) attributes. The proposed architecture features a shared representation of the clinical data obtained by integrating specialized embeddings of each data modality, enabling the detection of high-risk individuals using Transformer encoder layers. To assess the effectiveness of the proposed method, a strong baseline based on non-negative multi-layer perceptrons (MLPs) is introduced. The proposed method outperforms various baselines widely used in the domain of clinical risk assessment, while effectively handling missing values. In terms of explainability, our Transformer-based method offers easily interpretable results via attention weights, further enhancing the clinicians' decision-making process.
