| SPECIFICATIONS | |||||
| Max dimension | 2K | 2K | 4K | 1K | |
| Max tokens | 128K | 33K | 33K | 8K | |
| Parameters | — | 3.8B | 8B | 559M | |
| License | Proprietary | Apache 2.0 | Apache 2.0 | CC BY-NC 4.0 | |
| EMBEDDING TYPES | |||||
| Types |
Dense
|
DenseLate
|
Dense
|
Dense
|
|
| MATRYOSHKA DIMENSIONS | |||||
| Available sizes |
25651210241536
|
12825651210242048
|
3264128256512102420484096
|
32641282565121024
|
|
| INPUT MODALITIES | |||||
| Text | ✓ | ✓ | ✓ | ✓ | |
| Image | ✓ | ✓ | — | — | |
| ✓ | ✓ | — | — | ||
| OUTPUT TYPES | |||||
| Single vector | ✓ | ✓ | ✓ | ✓ | |
| Multi vector | — | ✓ | — | — | |
| LANGUAGE SUPPORT | |||||
| Languages | 🌍 100+ | 🌍 29+ | 🌍 100+ | 🌍 89+ | |
| PRICING | |||||
| Per 1M tokens | — | $0.050 | — | $0.020 | |
| BENCHMARKS | |||||
| CMTEB CHINESE | — | — | 73.84 | — | |
| COIR | — | 71.59 | — | — | |
| JINA VDR | — | 84.11 | — | — | |
| LONGEMBED | — | 67.11 | — | 55.66 | |
| MMTEB | — | 66.49 | — | 58.58 | |
| MTEB EN | — | 55.97 | — | 54.33 | |
| MTEB EN V2 | — | — | 75.22 | — | |
| MTEB MULTILINGUAL | — | — | 70.58 | — | |
| VIDORE | — | 90.17 | — | — | |