SCKansformer: Fine-Grained Classification of Bone Marrow Cells via Kansformer Backbone and Hierarchical Attention Mechanisms
Yifei Chen, Zhu Zhu, Shenghao Zhu, Linwei Qiu, Binfeng Zou, Fan Jia, Yunpeng Zhu, Chenyan Zhang, Zhaojie Fang, Feiwei Qin, Jin Fan, Changmiao Wang, Yu Gao, Gang Yu
TL;DR
Bone marrow cell classification is essential for diagnosing hematologic diseases but is hampered by high-dimensional micrograph data, long-tail class distributions, and subtle inter-class differences. The authors introduce SCKans transformer, which fuses a Kansformer Encoder (replacing MLP with Kolmogorov-Arnold networks), an SCConv Encoder (spatial and channel redundancy reduction), and a Global-Local Attention Encoder (global self-attention plus local feature extraction) to achieve robust fine-grained classification. The model is validated on a private BMCD-FGCD dataset (>10k samples, ~40 classes) and public BM datasets (PBC, ALL-IDB), outperforming ViT, EfficientNetV2, and specialized WBC models across accuracy, precision, recall, F1, and MCC; ablation studies confirm the necessity of each component. Additionally, the BMCD-FGCD dataset is released to the research community, underscoring the method’s practical impact for hematology and automated diagnostics and setting the stage for broader clinical deployment in bone marrow cytomorphology.
Abstract
The incidence and mortality rates of malignant tumors, such as acute leukemia, have risen significantly. Clinically, hospitals rely on cytological examination of peripheral blood and bone marrow smears to diagnose malignant tumors, with accurate blood cell counting being crucial. Existing automated methods face challenges such as low feature expression capability, poor interpretability, and redundant feature extraction when processing high-dimensional microimage data. We propose a novel fine-grained classification model, SCKansformer, for bone marrow blood cells, which addresses these challenges and enhances classification accuracy and efficiency. The model integrates the Kansformer Encoder, SCConv Encoder, and Global-Local Attention Encoder. The Kansformer Encoder replaces the traditional MLP layer with the KAN, improving nonlinear feature representation and interpretability. The SCConv Encoder, with its Spatial and Channel Reconstruction Units, enhances feature representation and reduces redundancy. The Global-Local Attention Encoder combines Multi-head Self-Attention with a Local Part module to capture both global and local features. We validated our model using the Bone Marrow Blood Cell Fine-Grained Classification Dataset (BMCD-FGCD), comprising over 10,000 samples and nearly 40 classifications, developed with a partner hospital. Comparative experiments on our private dataset, as well as the publicly available PBC and ALL-IDB datasets, demonstrate that SCKansformer outperforms both typical and advanced microcell classification methods across all datasets. Our source code and private BMCD-FGCD dataset are available at https://github.com/JustlfC03/SCKansformer.
