Faster, cost-efficient version of GPT-5 suitable for well-defined tasks and precise prompts.
Modalities
Context window
400,000
Pricing
input / output per 1M
Reasoning
Streaming
Real-time token-by-token response streaming
Function calling
Connect the model to external tools and systems
Structured outputs
Return responses in JSON schema format
Web search
Search the internet for real-time information
File search
Search and retrieve from uploaded files
Code execution
Execute code in a sandboxed environment
| Knowledge cutoff | 2024-05-31 |
| Model ID | gpt-5-mini |
| Provider | OpenAI |
| Tier | RPM | TPM | Batch queue |
|---|---|---|---|
| Tier1 | 500 | 500,000 | 5,000,000 |
| Tier2 | 5,000 | 2,000,000 | 20,000,000 |
| Tier3 | 5,000 | 4,000,000 | 40,000,000 |
| Tier4 | 10,000 | 10,000,000 | 1,000,000,000 |
| Tier5 | 30,000 | 180,000,000 | 15,000,000,000 |