LLM Models/Grok 4.1 Fast

Grok 4.1 Fast by xAI — 2.0M Context

A frontier multimodal model optimized specifically for high-performance agentic tool calling.

At a glance

Modalities

Context window

2,000,000

Pricing

$3.00 / $15.00

input / output per 1M

Reasoning

Enabled
💸Price Calculator
Input1.0M
$3.00
Output0.5M
$7.50
Total: $10.50
10Coffee
🥇
0.1Gold (g)
🍕
4.2Pizza
🐄
0.3%Cow
🎮
0.4%RTX 5090

Capabilities

Streaming

Real-time token-by-token response streaming

Function calling

Connect the model to external tools and systems

Structured outputs

Return responses in JSON schema format

Reasoning

The model thinks before responding

Caching

Cache responses to reduce latency and costs

Web search

Search the internet for real-time information

Live search

Real-time web search with source citations

Details

Model ID grok-4.1-fast
Provider xAI

Rate limits

Tier RPM TPM Batch queue
Default 480 4,000,000