The gpt-audio model is our first generally available audio model. It accepts audio inputs and outputs, and can be used in the Chat Completions REST API.
Specifications
Context128,000
Max Output16,384
Inputtext, audio
Outputtext, audio
Performance (7-day Average)
Uptime
TPS
RURT
Pricing
Input$2.50×1.1/MTokens
Output$10.00×1.1/MTokens
Input Audio$40.00×1.1/MTokens
Output Audio$80.00×1.1/MTokens