Embedding Models/GTE-Qwen2-7B-instruct

GTE-Qwen2-7B-instruct by Alibaba — 3584D

Top-ranked embedding model on MTEB English and Chinese. Built on Qwen2-7B with bidirectional attention and instruction tuning.

At a glance

Modalities

Dimensions

4K

Max tokens

32K

Parameters

7B

Price / 1M tokens

Type

Dense

Output types

Single VectorMulti Vector

Language support

🌍 Multilingual support

Benchmarks

MTEB 56 70.24
CMTEB 35 72.05
MTEB FR 68.25
MTEB PL 67.86

Details

Release date 2024-06-16
License Apache 2.0
Model ID gte-qwen2-7b-instruct
Provider Alibaba

Tags

text-embeddingmultilingualinstruction-awarelong-contextopen-source

What You Need to Know About GTE-Qwen2-7B-instruct

Complete Specifications for GTE-Qwen2-7B-instruct by Alibaba

Get detailed specifications for GTE-Qwen2-7B-instruct, including output dimensionality of 3584 dimensions, maximum input token length, supported input modalities, and pricing per million tokens. This embedding model is designed for semantic search, text classification, clustering, and retrieval-augmented generation applications where understanding the relationship between texts is essential.

Pricing and Cost Efficiency for GTE-Qwen2-7B-instruct

Review the pricing structure for GTE-Qwen2-7B-instruct and compare it against other embedding models from Alibaba and competitors. Understanding embedding model costs is essential when scaling vector search and RAG applications to millions of documents. We provide transparent pricing to help you budget effectively.

Use Cases and Applications for GTE-Qwen2-7B-instruct

Explore the ideal use cases for GTE-Qwen2-7B-instruct. Whether you are building a semantic search engine, recommendation system, document classification pipeline, or multilingual retrieval system, understanding this model's capabilities and dimensionality will help you choose the right embedding strategy for your project.