How do you use GPT-5.4 Image 2?

GPT-5.4 Image 2 can be called through OpenRouter's OpenAI-compatible API using the model ID openai/gpt-5.4-image-2. This page includes a ready-to-run curl example and the supported parameter list.

Is GPT-5.4 Image 2 free?

GPT-5.4 Image 2 is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

GPT-5.4 Image 2

All models

OpenAIGPT ModeratedReleased 2026-04-21

GPT-5.4 Image 2

Q: What is GPT-5.4 Image 2?

GPT-5.4 Image 2 is an AI model from OpenAI in the GPT series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

272K context

$8.00/M input

$15.00/M output

GPT-5.4 Image 2 is a multimodal AI model that combines OpenAI's GPT-5.4 capabilities with advanced image generation features from GPT Image 2. It supports seamless workflows across reasoning, coding, and visual generation, enabling users to interact with both text and image modalities within the same session. The model is designed for high-context tasks and multimodal analysis, making it suitable for diverse applications.

What is GPT-5.4 Image 2?

GPT-5.4 Image 2 is an AI model from OpenAI that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare GPT-5.4 Image 2 against other models for agent workflows and production deployments.

Model ID

Architecture & Specifications

Tokenizer

GPT

License

Proprietary

Released

2026-04-21

Modalities

Input

imagetextfile

Output

imagetext

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokenspresence_penaltyreasoningresponse_formatseedstopstructured_outputstop_logprobs

Strengths

Supports multimodal workflows with text and image generation
High-context reasoning and coding capabilities
Seamless integration of visual and textual outputs
Optimized for diverse applications requiring multimodal analysis

Limitations

No information on training data or knowledge cutoff
Structured output error rate of 4.84%
High token costs for input and output processing

Recommended Use Cases

Generating images from text prompts

Multimodal analysis combining text and visuals

Coding and reasoning tasks with visual outputs

Interactive workflows requiring both text and image modalities

High-context document understanding and synthesis

More from OpenAI

GPT-5.4 Nano

400K ctx$0.200/M

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency...

GPT-5.4 Mini

400K ctx$0.750/M

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...

GPT-5.4 Pro

1.1M ctx$30.00/M

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K...

GPT-5.4

1.1M ctx$2.50/M

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for...

Stay in the know

GPT-5.4 Image 2

What is GPT-5.4 Image 2?

More from OpenAI

Related content