Embedding Models/mxbai-embed-large-v1

mxbai-embed-large-v1 by Mixedbread — 1024D

SOTA BERT-large sized embedding model. Outperforms OpenAI text-embedding-3-large and matches models 20x its size. Supports Matryoshka and binary quantization.

At a glance

Modalities

Dimensions

1K

Max tokens

512

Parameters

335M

Price / 1M tokens

Type

Dense

Matryoshka dimensions

641282565121024

Output types

Single VectorMulti Vector

Language support

en

Benchmarks

MTEB AVG 64.68
MTEB RETRIEVAL 54.39
MTEB STS 85.00

Details

Release date 2024-03-01
License Apache 2.0
Model ID mxbai-embed-large-v1
Provider Mixedbread

Tags

text-embeddingmatryoshkabinary-quantizationopen-source