677M-parameter multilingual text embedding model scoring 71.7 on MTEB English v2 and 67.7 on MMTEB. Highest performance among multilingual embedding models under 1B parameters. Built on Qwen3-0.6B-Base with distillation from Qwen3-Embedding-4B. Supports 119+ languages, 32K tokens, 1024-dim with Matryoshka truncation. Robust under truncation and binary quantization.
Modalities
Dimensions
1K
Max tokens
33K
Parameters
677M
Price / 1M tokens
—
Type
| MTEB ENGLISH V2 | 71.70 |
| MMTEB | 67.70 |
| Release date | 2026-02-18 |
| License | CC BY-NC 4.0 |
| Model ID | jina-embeddings-v5-text-small |
| Provider | Jina AI |