Back to Models
gpt-audio
gpt-audio
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...
Modalities
Input
audiotext
Output
audiotext
Pricing
Cost per 1 million tokens
Input
$3
Output
$12
Model Specs
Context Window
128,000Max Output
16,384Release Date
2026-01-20Knowledge Cutoff
Capabilities
Reasoning
Tool Calling
Vision
Last Updated: 2026-03-15
Provider: