677M-parameter text-matching-targeted variant of v5-text-small. Optimized for symmetric pairwise similarity scoring, STS, paraphrase, and near-duplicate detection. 1024-dim embeddings with Matryoshka truncation. Supports 119+ languages up to 32K tokens. Available in GGUF, ONNX, and BF16 formats. Compatible with vLLM, TEI, llama.cpp, and sentence-transformers.
Modalities
Dimensions
1K
Max tokens
33K
Parameters
677M
Price / 1M tokens
—
Type
| Release date | 2026-02-18 |
| License | CC BY-NC 4.0 |
| Model ID | jina-embeddings-v5-text-small-text-matching |
| Provider | Jina AI |