OpenAI: GPT Audio

GPT

Multimodal

Paid

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

Parameters

Context Window

128,000

tokens

Input Price

$2.5

per 1M tokens

Output Price

$10

per 1M tokens

Capabilities

Model capabilities and supported modalities

Performance

Reasoning

Math

Coding

Specialized in code generation with excellent programming capabilities

Knowledge

Modalities

Input Modalities

text,audio

Output Modalities

text,audio

LLM Price Calculator

Calculate the cost of using this model

Input Tokens (0.00000250/token)$0.003750

Output Tokens (0.00001000/token)$0.030000

Common Scenarios:

Input Cost:$0.003750

Output Cost:$0.030000

Total Cost:$0.033750

Estimated usage: 4,500 tokens

Monthly Cost Estimator

Based on different usage levels

Light Usage

$0.1250

~10 requests

Moderate Usage

$1.2500

~100 requests

Heavy Usage

$12.5000

~1000 requests

Enterprise

$125.0000

~10,000 requests

Note: Estimates based on current token count settings per request.

Last Updated: 2026/05/08