Gemma 4 31B is an AI model from Google in the Gemma series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

How do you use Gemma 4 31B?

Gemma 4 31B can be called through OpenRouter's OpenAI-compatible API using the model ID google/gemma-4-31b-it. This page includes a ready-to-run curl example and the supported parameter list.

Gemma 4 31B is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

Gemma 4 31B

All models

GoogleGemmaReleased April 2, 2026

Gemma 4 31B

262K context

$0.130/M input

$0.380/M output

30.7B

Gemma 4 31B Instruct is a dense multimodal model developed by Google DeepMind with 30.7 billion parameters. It supports both text and image inputs and provides text outputs, featuring a 256K token context window, configurable reasoning modes, native function calling, and multilingual support across 140+ languages. The model excels in coding, reasoning, and document understanding tasks.

What is Gemma 4 31B?

Gemma 4 31B is an AI model from Google that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare Gemma 4 31B against other models for agent workflows and production deployments.

Model ID

Architecture & Specifications

Architecture

Dense Transformer

Parameters

30.7B

Tokenizer

Gemma

License

Apache 2.0

Released

April 2, 2026

Modalities

Input

imagetextvideo

Output

text

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p

Strengths

Supports text and image inputs with text outputs
256K token context window
Multilingual support across 140+ languages
Excels in coding, reasoning, and document understanding tasks
Configurable reasoning modes and native function calling

Limitations

Low performance on research-level physics reasoning (CritPt: 1.4%)
Moderate hallucination rate in knowledge tasks (18.4%)
Limited accuracy in economically valuable tasks (GDPval-AA: 30.9%)
Relatively low omniscience accuracy (19.9%)
Performance varies significantly across benchmarks

Recommended Use Cases

Coding and software development

Document understanding and summarization

Multilingual communication and translation

Scientific reasoning and analysis

Legal and financial document processing

More from Google

Gemma 4 26B A4B

262K ctx$0.060/M

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

Gemma 4 31B (free)

262K ctxFree

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Gemma 4 26B A4B (free)

262K ctxFree

Lyria 3 Clip Preview

1.0M ctxFree

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...

Stay in the know

Gemma 4 31B

What is Gemma 4 31B?

More from Google

Related content