OneComp: One-Line Revolution for Generative AI Model Compression

Yuma Ichikawa; Keiji Kimura; Akihiro Yoshida; Yudai Fujimoto; Hiroki Tokura; Yamato Arai; Yoshiyuki Ishii; Yusei Kawakami; Genki Shikada; Achille Jacquemond; Yoshihiko Fujisawa; Katsuki Fujisawa; Takumi Honda; Akira Sakai

OneComp: One-Line Revolution for Generative AI Model Compression

Yuma Ichikawa, Keiji Kimura, Akihiro Yoshida, Yudai Fujimoto, Hiroki Tokura, Yamato Arai, Yoshiyuki Ishii, Yusei Kawakami, Genki Shikada, Achille Jacquemond, Yoshihiko Fujisawa, Katsuki Fujisawa, Takumi Honda, Akira Sakai

Abstract

Deploying foundation models is increasingly constrained by memory footprint, latency, and hardware costs. Post-training compression can mitigate these bottlenecks by reducing the precision of model parameters without significantly degrading performance; however, its practical implementation remains challenging as practitioners navigate a fragmented landscape of quantization algorithms, precision budgets, data-driven calibration strategies, and hardware-dependent execution regimes. We present OneComp, an open-source compression framework that transforms this expert workflow into a reproducible, resource-adaptive pipeline. Given a model identifier and available hardware, OneComp automatically inspects the model, plans mixed-precision assignments, and executes progressive quantization stages, ranging from layer-wise compression to block-wise refinement and global refinement. A key architectural choice is treating the first quantized checkpoint as a deployable pivot, ensuring that each subsequent stage improves the same model and that quality increases as more compute is invested. By converting state-of-the-art compression research into an extensible, open-source, hardware-aware pipeline, OneComp bridges the gap between algorithmic innovation and production-grade model deployment.

OneComp: One-Line Revolution for Generative AI Model Compression

Abstract

OneComp: One-Line Revolution for Generative AI Model Compression

Abstract

Paper Structure

Table of Contents

Figures (6)