1.74B-parameter multimodal omni embedding model accepting text, images, video, and audio with shared vector space aligned to text-only embeddings. Supports 4 tasks (retrieval, classification, clustering, text-matching) with task-specific adapters. 1024-dim embeddings with Matryoshka truncation down to 32 dims. Built on Qwen3 architecture with frozen-tower composition.
Modalities
Dimensions
1K
Max tokens
33K
Parameters
1.74B
Price / 1M tokens
—
Type
| Release date | 2026-05-01 |
| License | CC BY-NC 4.0 |
| Model ID | jina-embeddings-v5-omni-small |
| Provider | Jina AI |