Machine Learning Class Numbers of Real Quadratic Fields
Malik Amir, Yang-Hui He, Kyu-Hwan Lee, Thomas Oliver, Eldar Sultanow
TL;DR
This work investigates the problem of distinguishing real quadratic fields by class number using finitely many Dedekind zeta-coefficient features. It formalizes genus-theoretic constraints, introduces counting-function-based cost measures, and a bubble algorithm to quantify separability between class numbers, while employing PCA for dimensionality insights. For the {1,2} case, gradient-boosting models and symbolic classification recover genus-theory parities and yield explicit approximate class-number formulas. For the {1,3} case, zeta-coefficients alone are insufficient, but combining discriminants, regulators, and partial zeta-sums with symbolic methods yields high-accuracy predictors and explicit formulas, highlighting the differing data-and-feature requirements across class-number pairs and linking ML results to classical number-theoretic invariants.
Abstract
We implement and interpret various supervised learning experiments involving real quadratic fields with class numbers 1, 2 and 3. We quantify the relative difficulties in separating class numbers of matching/different parity from a data-scientific perspective, apply the methodology of feature analysis and principal component analysis, and use symbolic classification to develop machine-learned formulas for class numbers 1, 2 and 3 that apply to our dataset.
