Embedding Models/Qwen3-Embedding-8B

Qwen3-Embedding-8B by Qwen — 4096D

State-of-the-art multilingual text embedding model. #1 on MTEB multilingual leaderboard. Supports 100+ languages including programming languages, with flexible output dimensions from 32 to 4096.

At a glance

Modalities

Dimensions

4K

Max tokens

33K

Parameters

8B

Price / 1M tokens

—

Type

Dense

Matryoshka dimensions

3264128256512102420484096

Output types

Single VectorMulti Vector

Language support

🌍 Multilingual support (100+ languages)

Benchmarks

MTEB MULTILINGUAL	70.58
MTEB EN V2	75.22
CMTEB CHINESE	73.84

Details

Release date	2025-06-05
License	Apache 2.0
Model ID	qwen3-embedding-8b
Provider	Qwen

What You Need to Know About Qwen3-Embedding-8B

Complete Specifications for Qwen3-Embedding-8B by Qwen

Get detailed specifications for Qwen3-Embedding-8B, including output dimensionality of 4096 dimensions, maximum input token length, supported input modalities, and pricing per million tokens. This embedding model is designed for semantic search, text classification, clustering, and retrieval-augmented generation applications where understanding the relationship between texts is essential.

Pricing and Cost Efficiency for Qwen3-Embedding-8B

Review the pricing structure for Qwen3-Embedding-8B and compare it against other embedding models from Qwen and competitors. Understanding embedding model costs is essential when scaling vector search and RAG applications to millions of documents. We provide transparent pricing to help you budget effectively.

Use Cases and Applications for Qwen3-Embedding-8B

Explore the ideal use cases for Qwen3-Embedding-8B. Whether you are building a semantic search engine, recommendation system, document classification pipeline, or multilingual retrieval system, understanding this model's capabilities and dimensionality will help you choose the right embedding strategy for your project.