LLM Models/Gemma 4 31B

Gemma 4 31B by Google DeepMind — 256K Context

30.7B-parameter dense multimodal model from Google DeepMind with 256K token context. Handles text, image, and video input with text output. Features hybrid attention (sliding window + global), configurable thinking mode, native function calling, and multilingual support in 140+ languages. Optimized for coding, reasoning, and agentic workflows.

At a glance

Modalities

Context window

256,000

Pricing

/

input / output per 1M

Reasoning

Enabled

Capabilities

Streaming

Real-time token-by-token response streaming

Function calling

Connect the model to external tools and systems

Structured outputs

Return responses in JSON schema format

Fine-tuning

Custom model training on your data

Reasoning

Extended thinking before responding

Benchmarks

MMLU PRO 85.2
AIME 2026 89.2
LIVECODEBENCH V6 80.0
CODEFORCES ELO 2150
GPQA DIAMOND 84.3
TAU2 AVG 76.9
HLE NO TOOLS 19.5
HLE WITH SEARCH 26.5
BIGBENCH EXTRA HARD 74.4
MMMLU 88.4
MMMU PRO 76.9
OMNIDOCBENCH 1 5 0.131
MATH VISION 85.6
MEDXPERTQA MM 61.3
LONG CONTEXT MRCR V2 66.4

Details

Release date 2026-05-01
Model ID gemma-4-31b
Provider Google DeepMind

What You Need to Know About Gemma 4 31B

Complete Overview of Gemma 4 31B by Google DeepMind

Get detailed information about Gemma 4 31B, including its context window of 256000 tokens, pricing per million tokens, supported input and output modalities, and benchmark scores. This model from Google DeepMind offers specific capabilities for natural language processing, code generation, and complex reasoning tasks that set it apart from alternatives.

Pricing and Cost Analysis for Gemma 4 31B

Compare input and output token pricing for Gemma 4 31B against other models in its class. Understanding LLM pricing is essential for budgeting your AI applications at scale. We break down the cost per million tokens for both input and output so you can estimate the total cost of your workloads and compare value across providers.

Benchmarks and Performance Metrics for Gemma 4 31B

Review benchmark performance data for Gemma 4 31B across key evaluation metrics. Compare its reasoning, coding, and language understanding capabilities against competing models to determine if it is the right fit for your specific requirements, whether that involves complex analysis, creative generation, or efficient inference at scale.