Sabiá-3 Technical Report

Hugo Abonizio; Thales Sales Almeida; Thiago Laitz; Roseval Malaquias Junior; Giovana Kerche Bonás; Rodrigo Nogueira; Ramon Pires

Sabiá-3 Technical Report

Hugo Abonizio, Thales Sales Almeida, Thiago Laitz, Roseval Malaquias Junior, Giovana Kerche Bonás, Rodrigo Nogueira, Ramon Pires

TL;DR

This paper introduces Sabiá-3 and Sabiazinho-3, Brazil-focused language models trained on a large Brazilian Portuguese corpus to leverage domain-specific linguistic and cultural knowledge. The authors employ a two-phase training regime—pre-training on specialized data with next-token prediction followed by instruction tuning and preference alignment—implemented with TPU v5 and Jax to scale data and model parallelism. Results show Sabiá-3 achieves competitive performance with frontier LLMs on knowledge-intensive tasks at a substantially lower cost per token, while outperforming its predecessor in reasoning and long-context processing; however, it still trails top-tier models on multi-step tasks and some instruction-following benchmarks. The findings highlight the practical value of domain specialization for cost-effective, high-signal performance in Brazil-centric applications, and point to future work in enhancing multi-turn instruction-following and agentic capabilities.

Abstract

This report presents Sabiá-3, our new flagship language model, and Sabiazinho-3, a more cost-effective sibling. The models were trained on a large brazilian-centric corpus. Evaluations across diverse professional and academic benchmarks show a strong performance on Portuguese and Brazil-related tasks. Sabiá-3 shows large improvements in comparison to our previous best of model, Sabia-2 Medium, especially in reasoning-intensive tasks. Notably, Sabiá-3's average performance matches frontier LLMs, while it is offered at a three to four times lower cost per token, reinforcing the benefits of domain specialization.

Sabiá-3 Technical Report

TL;DR

Abstract

Sabiá-3 Technical Report

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)