OpenAI: GPT-4o Audio
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.
Parameters
~1.8T
Context Window
128,000
tokens
Input Price
$2.5
per 1M tokens
Output Price
$10
per 1M tokens
Capabilities
Model capabilities and supported modalities
Performance
Excellent reasoning capabilities with strong logical analysis
Strong mathematical capabilities, handles complex calculations well
Strong coding abilities across multiple programming languages
Extensive knowledge base with broad coverage of topics
Modalities
audio,text
text,audio
LLM Price Calculator
Calculate the cost of using this model
Monthly Cost Estimator
Based on different usage levels
