GPT-4o Audio
All models
OpenAIOpenAIGPT Moderated

GPT-4o Audio

128K context$2.50/M input$10.00/M output

GPT-4o Audio is an AI model from OpenAI built for agent workflows, with support for audio, text input and text, audio output. The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

What is GPT-4o Audio?

GPT-4o Audio is an AI model from OpenAI that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare GPT-4o Audio against other models for agent workflows and production deployments.

Model ID

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

Modalities
Input
audiotext
Output
textaudio
Supported Parameters
frequency_penaltylogit_biaslogprobsmax_tokenspresence_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_logprobstop_p

Related content