How do you use GPT-4o Audio?

GPT-4o Audio can be called through OpenRouter's OpenAI-compatible API using the model ID openai/gpt-4o-audio-preview. This page includes a ready-to-run curl example and the supported parameter list.

Is GPT-4o Audio free?

GPT-4o Audio is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

GPT-4o Audio

All models

OpenAIGPT Moderated

GPT-4o Audio

Q: What is GPT-4o Audio?

GPT-4o Audio is an AI model from OpenAI in the GPT series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

128K context

$2.50/M input

$10.00/M output

GPT-4o Audio is an AI model from OpenAI built for agent workflows, with support for audio, text input and text, audio output. The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

What is GPT-4o Audio?

GPT-4o Audio is an AI model from OpenAI that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare GPT-4o Audio against other models for agent workflows and production deployments.

Model ID

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

Modalities

Input

audiotext

Output

textaudio

Supported Parameters

frequency_penaltylogit_biaslogprobsmax_tokenspresence_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_logprobstop_p

More from OpenAI

GPT-5.4 Image 2

272K ctx$8.00/M

[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...

GPT-5.4 Nano

400K ctx$0.200/M

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency...

GPT-5.4 Mini

400K ctx$0.750/M

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...

GPT-5.4 Pro

1.1M ctx$30.00/M

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K...

Stay in the know

GPT-4o Audio

What is GPT-4o Audio?

More from OpenAI

Related content