GPT Audio is an AI model from OpenAI in the GPT series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

How do you use GPT Audio?

GPT Audio can be called through OpenRouter's OpenAI-compatible API using the model ID openai/gpt-audio. This page includes a ready-to-run curl example and the supported parameter list.

GPT Audio is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

GPT Audio

All models

OpenAIGPT Moderated

GPT Audio

128K context

$2.50/M input

$10.00/M output

GPT Audio is an AI model from OpenAI built for agent workflows, with support for text, audio input and text, audio output. The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

What is GPT Audio?

GPT Audio is an AI model from OpenAI that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare GPT Audio against other models for agent workflows and production deployments.

Model ID

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

Modalities

Input

textaudio

Output

textaudio

Supported Parameters

frequency_penaltylogit_biaslogprobsmax_tokenspresence_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_logprobstop_p

More from OpenAI

GPT-5.4 Image 2

272K ctx$8.00/M

[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...

GPT-5.4 Nano

400K ctx$0.200/M

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency...

GPT-5.4 Mini

400K ctx$0.750/M

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...

GPT-5.4 Pro

1.1M ctx$30.00/M

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K...

Stay in the know

GPT Audio

What is GPT Audio?

More from OpenAI

Related content