From Images to Detection: Machine Learning for Blood Pattern Classification
Yilin Li, Weining Shen
TL;DR
This paper tackles the problem of distinguishing gunshot backspatter from impact spatter in bloodstain pattern analysis by building an interpretable feature pipeline from ellipse-based stain extraction, followed by region-based summarization to create pattern-level descriptors. It introduces the Stability Importance Score (SIS) to robustly gauge feature importance across multiple model fits and demonstrates that XGBoost, complemented by region-aware and shading features, achieves high classification accuracy (up to ~$92$–$97 ext{%}$ depending on metric) on a public dataset. The work also compares with Random Forest and other classifiers, showing superior performance for ensemble methods and highlighting data quality as a key factor for cross-dataset generalization, with limited transfer to other datasets. Overall, the approach provides a quantitative, scalable framework for BPA decision-support and sets directions for richer feature spaces and more diverse datasets.
Abstract
Bloodstain Pattern Analysis (BPA) helps us understand how bloodstains form, with a focus on their size, shape, and distribution. This aids in crime scene reconstruction and provides insight into victim positions and crime investigation. One challenge in BPA is distinguishing between different types of bloodstains, such as those from firearms, impacts, or other mechanisms. Our study focuses on differentiating impact spatter bloodstain patterns from gunshot bloodstain patterns. We distinguish patterns by extracting well-designed individual stain features, applying effective data consolidation methods, and selecting boosting classifiers. As a result, we have developed a model that excels in both accuracy and efficiency. In addition, we use outside data sources from previous studies to discuss the challenges and future directions for BPA.
