Filtering instances and rejecting predictions to obtain reliable models in healthcare

Maria Gabriela Valeriano; David Kohan Marzagão; Alfredo Montelongo; Carlos Roberto Veiga Kiffer; Natan Katz; Ana Carolina Lorena

Filtering instances and rejecting predictions to obtain reliable models in healthcare

Maria Gabriela Valeriano, David Kohan Marzagão, Alfredo Montelongo, Carlos Roberto Veiga Kiffer, Natan Katz, Ana Carolina Lorena

TL;DR

The paper tackles reliability in healthcare ML by pairing data-centric data refinement with a safety-oriented rejection mechanism. It introduces Instance Hardness (IH) as a consensus-based metric to prune hard training examples and couples this with a calibrated confidence-based reject option at inference, guided by a cost function balancing accuracy, confidence, and coverage. Key contributions include a detailed framework and a heuristic for threshold selection, evaluation on three real-world Brazilian health datasets, and baseline comparisons using influence values and uncertainty-based rejection. The results show that IH filtering together with confidence-based rejection improves predictive reliability while preserving a large share of predictions, supporting practical deployment in safety-critical settings. The framework is adaptable, data-centric, and emphasizes transparent, risk-aware decision-making for real-world healthcare applications.

Abstract

Machine Learning (ML) models are widely used in high-stakes domains such as healthcare, where the reliability of predictions is critical. However, these models often fail to account for uncertainty, providing predictions even with low confidence. This work proposes a novel two-step data-centric approach to enhance the performance of ML models by improving data quality and filtering low-confidence predictions. The first step involves leveraging Instance Hardness (IH) to filter problematic instances during training, thereby refining the dataset. The second step introduces a confidence-based rejection mechanism during inference, ensuring that only reliable predictions are retained. We evaluate our approach using three real-world healthcare datasets, demonstrating its effectiveness at improving model reliability while balancing predictive performance and rejection rate. Additionally, we use alternative criteria - influence values for filtering and uncertainty for rejection - as baselines to evaluate the efficiency of the proposed method. The results demonstrate that integrating IH filtering with confidence-based rejection effectively enhances model performance while preserving a large proportion of instances. This approach provides a practical method for deploying ML systems in safety-critical applications.

Filtering instances and rejecting predictions to obtain reliable models in healthcare

TL;DR

Abstract

Filtering instances and rejecting predictions to obtain reliable models in healthcare

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)