deepseek-v4-flash

Common Name: DeepSeek V4 Flash

SupportedTool InvocationSupportedReasoning

DeepSeek V4 Flash is a hybrid-thinking model with 1M context and 384K max output. It supports both non-thinking and thinking modes, with thinking enabled by default.

Specifications

Context

1000K

Maximum Output

384K

Inputtext

Outputtext

Performance (7-day Average)

Collecting…

Pricing

Input¥1.00/MTokens

Cached Input¥0.02/MTokens

Output¥2.00/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

DeepSeek Chat

¥1.00/¥2.00/M

ctx1.0Mmax384Kavail—tps—

InOutCap

Compatibility alias for deepseek-v4-flash in non-thinking mode. DeepSeek will deprecate this alias in the future.

DeepSeek Reasoner

¥1.00/¥2.00/M

ctx1.0Mmax384Kavail—tps—

InOutCap

RankedInstruction#2

Compatibility alias for deepseek-v4-flash in thinking mode. DeepSeek will deprecate this alias in the future.

DeepSeek V4 Pro

¥3.00/¥6.00/M

ctx1.0Mmax384Kavail—tps—

InOutCap

DeepSeek V4 Pro is the higher-capability hybrid-thinking model in the V4 family. It supports both non-thinking and thinking modes, with 1M context and 384K max output.

GPT-5.6 Luna

$1.10/$6.60/M

ctx1.1Mmax128Kavail—tps—

InOutCap

GPT-5.6 Luna is optimized for cost-sensitive, high-volume workloads in the GPT-5.6 family.