The Return of Pseudosciences in Artificial Intelligence: Have Machine Learning and Deep Learning Forgotten Lessons from Statistics and History?
Jérémie Sublime
TL;DR
This paper argues that modern ML/DL systems frequently misinterpret correlations as causation, effectively reviving pseudosciences under a polished AI veneer. It surveys high-stakes applications in justice, security, and sociology to show how high performance metrics can obscure real harms, particularly via false positives. It critiques the reliance on theory-free, data-driven approaches and biased datasets, advocating for harm-focused metrics and continuous human oversight. By drawing on statistical history, the authors call for rethinking AI models, evaluation criteria, and domain-aligned ethics training to prevent discriminatory or dangerous outcomes.
Abstract
In today's world, AI programs powered by Machine Learning are ubiquitous, and have achieved seemingly exceptional performance across a broad range of tasks, from medical diagnosis and credit rating in banking, to theft detection via video analysis, and even predicting political or sexual orientation from facial images. These predominantly deep learning methods excel due to their extraordinary capacity to process vast amounts of complex data to extract complex correlations and relationship from different levels of features. In this paper, we contend that the designers and final users of these ML methods have forgotten a fundamental lesson from statistics: correlation does not imply causation. Not only do most state-of-the-art methods neglect this crucial principle, but by doing so they often produce nonsensical or flawed causal models, akin to social astrology or physiognomy. Consequently, we argue that current efforts to make AI models more ethical by merely reducing biases in the training data are insufficient. Through examples, we will demonstrate that the potential for harm posed by these methods can only be mitigated by a complete rethinking of their core models, improved quality assessment metrics and policies, and by maintaining humans oversight throughout the process.
