Practical machine learning is learning on small samples

Marina Sapir

Practical machine learning is learning on small samples

Marina Sapir

TL;DR

This work reframes machine learning from a statistical, asymptotic paradigm to a practical, logic-grounded one. It argues that real-world learning operates under implicit smoothness and finite data, and introduces the Practical learning paradigm built on baseline cases, counterparts, and inconsistency measures. By showing that common algorithms like ERM, k-NN, decision trees, Naive Bayes, and linear SVM/SVR can be interpreted as practical learners minimizing total inconsistency, the paper justifies a unifying framework grounded in abduction and oscillation-minimizing criteria. The approach enables meaningful comparisons, robust testing, and handling of outliers and data-scarce scenarios, with broad implications for practice and future extensions to other ML problems.

Abstract

Based on limited observations, machine learning discerns a dependence which is expected to hold in the future. What makes it possible? Statistical learning theory imagines indefinitely increasing training sample to justify its approach. In reality, there is no infinite time or even infinite general population for learning. Here I argue that practical machine learning is based on an implicit assumption that underlying dependence is relatively ``smooth" : likely, there are no abrupt differences in feedback between cases with close data points. From this point of view learning shall involve selection of the hypothesis ``smoothly" approximating the training set. I formalize this as Practical learning paradigm. The paradigm includes terminology and rules for description of learners. Popular learners (local smoothing, k-NN, decision trees, Naive Bayes, SVM for classification and for regression) are shown here to be implementations of this paradigm.

Practical machine learning is learning on small samples

TL;DR

Abstract

Paper Structure (20 sections, 2 theorems, 46 equations)

This paper contains 20 sections, 2 theorems, 46 equations.

Givens, goals and assumptions
Traditional views on ML
Is ML an induction?
Is ML a prediction problem?
Does Statistical Learning describe ML?
Practical learning. Main concepts
Definitions
Intended meaning
Local smoothing is a practical learner
k-NN is a practical learner
Decision tree is a practical learner
Naive Bayes is a practical learner
Linear SVM for classification is a practical learner
Linear Support vector regression is a practical learner
Advantages of Practical learning theory
...and 5 more sections

Key Result

Lemma 1

The problems Linear SVM* and Linear SVM are equivalent.

Theorems & Definitions (9)

Definition 1
Definition 2
Definition 3
proof
proof
Lemma 1
proof
Theorem 1
proof

Practical machine learning is learning on small samples

TL;DR

Abstract

Practical machine learning is learning on small samples

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (9)