MCGrad: Multicalibration at Web Scale

Niek Tax; Lorenzo Perini; Fridolin Linder; Daniel Haimovich; Dima Karamshuk; Nastaran Okati; Milan Vojnovic; Pavlos Athanasios Apostolopoulos

MCGrad: Multicalibration at Web Scale

Niek Tax, Lorenzo Perini, Fridolin Linder, Daniel Haimovich, Dima Karamshuk, Nastaran Okati, Milan Vojnovic, Pavlos Athanasios Apostolopoulos

TL;DR

MCGrad tackles the challenge of achieving multicalibration at web scale by eliminating the need to manually specify protected groups and by leveraging multi-round gradient boosting with an augmented input that includes the base predictor’s outputs. The method uses efficient, low-latency training and a logit-rescaling step to maintain calibration without harming log loss or PRAUC, and it employs early stopping and Hessian-based leaf regularization to guard against overfitting. Empirically, MCGrad demonstrates strong multicalibration improvements on public benchmarks and achieves substantial, consistent gains in production settings at Meta across hundreds of models, with favorable latency characteristics for online deployment. The work provides practical guidance for industry adoption and links multicalibration to broader performance metrics, underscoring the real-world value of calibrated, subgroup-aware predictions.

Abstract

We propose MCGrad, a novel and scalable multicalibration algorithm. Multicalibration - calibration in subgroups of the data - is an important property for the performance of machine learning-based systems. Existing multicalibration methods have thus far received limited traction in industry. We argue that this is because existing methods (1) require such subgroups to be manually specified, which ML practitioners often struggle with, (2) are not scalable, or (3) may harm other notions of model performance such as log loss and Area Under the Precision-Recall Curve (PRAUC). MCGrad does not require explicit specification of protected groups, is scalable, and often improves other ML evaluation metrics instead of harming them. MCGrad has been in production at Meta, and is now part of hundreds of production models. We present results from these deployments as well as results on public datasets. We provide an open source implementation of MCGrad at https://github.com/facebookincubator/MCGrad.

MCGrad: Multicalibration at Web Scale

TL;DR

Abstract

MCGrad: Multicalibration at Web Scale

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (9)