Balanced for intelligence, speed, and cost that makes it an attractive model for many use cases.
Specifications
Context1,047,576
Max Output32,768
Inputtext, image
Outputtext, json
Performance (7-day Average)
Uptime
TPS
RURT
Pricing
Input$0.40×1.1/MTokens
Output$1.60×1.1/MTokens
Cached Input$0.10×1.1/MTokens