Embedding Models/Jina Embeddings v5 Text Small Text-Matching

Jina Embeddings v5 Text Small Text-Matching by Jina AI — 1024D

677M-parameter text-matching-targeted variant of v5-text-small. Optimized for symmetric pairwise similarity scoring, STS, paraphrase, and near-duplicate detection. 1024-dim embeddings with Matryoshka truncation. Supports 119+ languages up to 32K tokens. Available in GGUF, ONNX, and BF16 formats. Compatible with vLLM, TEI, llama.cpp, and sentence-transformers.

At a glance

Modalities

Dimensions

1K

Max tokens

33K

Parameters

677M

Price / 1M tokens

Type

Dense

Matryoshka dimensions

32641282565127681024

Output types

DenseLate Interaction

Language support

multilingual

Details

Release date 2026-02-18
License CC BY-NC 4.0
Model ID jina-embeddings-v5-text-small-text-matching
Provider Jina AI

Tags

texttext-matchingsentence-similarity