Towards Human-Centered Early Prediction Models for Academic Performance in Real-World Contexts
Han Zhang, Yiyi Ren, Paula S. Nurius, Jennifer Mankoff, Anind K. Dey
TL;DR
This paper tackles the problem of predicting academic performance early in real-world settings by integrating three approaches—Logistic Regression (LR), 1D-CNN, and MTL-1D-CNN—trained on passive behavioral data and self-reports collected within the first week of Spring terms. It emphasizes HCML principles (explainability, fairness, generalizability) and demonstrates that predictions can be made by Week 1 with a GPA threshold of $3.2$, while highlighting trade-offs among the principles. LR and 1D-CNN achieve high early accuracy and reveal interpretable predictors, though deep learning approaches struggle with fairness and generalizability; MTL-1D-CNN offers stronger cross-year robustness but shows fairness and explainability challenges. The findings underscore socio-technical challenges in deploying such systems and advocate for human-centered, privacy-preserving, and governance-aware integration into multi-stakeholder student support workflows to enable timely, equitable interventions.
Abstract
Supporting student success requires collaboration among multiple stakeholders. Researchers have explored machine learning models for academic performance prediction; yet key challenges remain in ensuring these models are interpretable, equitable, and actionable within real-world educational support systems. First, many models prioritize predictive accuracy but overlook human-centered machine learning principles, limiting trust among students and reducing their usefulness for educators and institutional decision-makers. Second, most models require at least a month of data before making reliable predictions, delaying opportunities for early intervention. Third, current models primarily rely on sporadically collected, classroom-derived data, missing broader behavioral patterns that could provide more continuous and actionable insights. To address these gaps, we present three modeling approaches-LR, 1D-CNN, and MTL-1D-CNN-to classify students as low or high academic performers. We evaluate them based on explainability, fairness, and generalizability to assess their alignment with key social values. Using behavioral and self-reported data collected within the first week of two Spring terms, we demonstrate that these models can identify at-risk students as early as week one. However, trade-offs across human-centered machine learning principles highlight the complexity of designing predictive models that effectively support multi-stakeholder decision-making and intervention strategies. We discuss these trade-offs and their implications for different stakeholders, outlining how predictive models can be integrated into student support systems. Finally, we examine broader socio-technical challenges in deploying these models and propose future directions for advancing human-centered, collaborative academic prediction systems.
