Back to Models

gpt-audio

gpt-audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

Modalities

Input

audiotext

Output

audiotext
Pricing
Cost per 1 million tokens
Input
$3
Output
$12
Model Specs
Context Window
128,000
Max Output
16,384
Release Date
2026-01-20
Knowledge Cutoff
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2026-03-15

Provider: