glm-5-turbo

Common Name: GLM-5 Turbo

ChatGLM
Released on Mar 16 12:00 AMKnowledge Cutoff Apr 1, 2025 12:00 AMSupportedTool InvocationSupportedReasoning
CompareTry in Chat

GLM-5-Turbo is the fast variant of GLM-5, optimized for lower latency and cost while retaining strong agentic and coding performance.

Specifications

Context
200K
Maximum Output
128K
Inputtext
Outputtext

Performance (7-day Average)

Collecting…
Collecting…
Collecting…

Pricing

< 32K
Input¥5.50/MTokens
Output¥24.20/MTokens
Cached Input¥1.32/MTokens
32K-200K
Input¥7.70/MTokens
Output¥28.60/MTokens
Cached Input¥1.98/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

Free/Free
ctx131Kmax98Kavailtps
InOutCap

Fast, cost-efficient version of GLM-4.5. Optimized for high-throughput applications.

¥4.40/¥13.20/M
ctx128Kmaxavailtps
InOutCap

Zhipu AI's GLM-4.5 AirX variant optimized for high-speed inference.

¥0.88/¥2.20/M
ctx131Kmax98Kavailtps
InOutCap

Zhipu AI's lightweight GLM-4.5 variant for cost-effective tasks.

¥8.80/¥17.60/M
ctx128Kmaxavailtps
InOutCap

Zhipu AI's GLM-4.5 X variant with enhanced performance.