GPT-5.4 Mini
All models
OpenAIOpenAIGPTReleased March 17, 2026

GPT-5.4 Mini

400K context$0.750/M input$4.50/M output

GPT-5.4 Mini is a streamlined version of OpenAI's GPT-5.4 model, optimized for high-throughput workloads. It supports both text and image inputs and excels in reasoning, coding, and tool use while reducing latency and cost for large-scale deployments. The model is designed for production environments requiring a balance of capability and efficiency, making it suitable for applications like chatbots, coding assistants, and agent workflows.

What is GPT-5.4 Mini?

GPT-5.4 Mini is an AI model from OpenAI that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare GPT-5.4 Mini against other models for agent workflows and production deployments.

Model ID

GPT-5.4 Mini is a streamlined version of OpenAI's GPT-5.4 model, optimized for high-throughput workloads. It supports both text and image inputs and excels in reasoning, coding, and tool use while reducing latency and cost for large-scale deployments. The model is designed for production environments requiring a balance of capability and efficiency, making it suitable for applications like chatbots, coding assistants, and agent workflows.

Architecture & Specifications
Tokenizer
GPT
Training Data
Knowledge cutoff August 31, 2025
License
Proprietary
Released
March 17, 2026
Modalities
Input
fileimagetext
Output
text
Supported Parameters
include_reasoningmax_completion_tokensmax_tokensreasoningresponse_formatseedstructured_outputstool_choicetools
Strengths
  • Supports both text and image inputs
  • Optimized for high-throughput workloads
  • Reliable instruction following
  • Solid multi-step reasoning
  • Improved cost efficiency for large-scale deployments
Limitations
  • Hallucination rate of 10.2% in knowledge tasks
  • Lower performance in research-level physics reasoning (10.0%)
  • Limited accuracy in economically valuable tasks (46.7%)
  • Moderate coding capability compared to specialized models
  • Latency varies across providers
Recommended Use Cases
Chat applications
Coding assistants
Agent workflows
Large-scale deployments
Multimodal tasks combining text and image inputs

Related content

Data enriched Apr 24, 2026. Pricing from OpenRouter API.