jina-embeddings-v3: Multilingual Embeddings With Task LoRA
Saba Sturua, Isabelle Mohr, Mohammad Kalim Akram, Michael Günther, Bo Wang, Markus Krimmel, Feng Wang, Georgios Mastrapas, Andreas Koukounas, Nan Wang, Han Xiao
TL;DR
Jina-embeddings-v3 delivers multilingual, long-context embeddings with a compact 570M parameter backbone enhanced by task-specific LoRA adapters and Matryoshka representation learning. The approach combines RoPE-based long-context encoding, a frozen XLM-RoBERTa foundation, and dedicated adapters for retrieval, clustering, classification, and text matching, achieving 1024-dimensional embeddings with scalable performance. Evaluations on MTEB show strong monolingual English results and competitive multilingual performance, surpassing several proprietary multilingual embeddings while offering substantial cost advantages over large LLM-based alternatives. The work also analyzes retrieval failures and demonstrates robust improvements via synthetic data augmentation and ablation studies on embedding dimension and retrieval asymmetry, highlighting practical potential for production and edge deployment.
Abstract
We introduce jina-embeddings-v3, a novel text embedding model with 570 million parameters, achieves state-of-the-art performance on multilingual data and long-context retrieval tasks, supporting context lengths of up to 8192 tokens. The model includes a set of task-specific Low-Rank Adaptation (LoRA) adapters to generate high-quality embeddings for query-document retrieval, clustering, classification, and text matching. Evaluation on the MTEB benchmark shows that jina-embeddings-v3 outperforms the latest proprietary embeddings from OpenAI and Cohere on English tasks, while achieving superior performance compared to multilingual-e5-large-instruct across all multilingual tasks. With a default output dimension of 1024, users can flexibly reduce the embedding dimensions to as low as 32 without compromising performance, enabled by Matryoshka Representation Learning.
