DeepSeek V4 Pro is the flagship V4 model for advanced reasoning, agentic coding, tool-use, and million-token context workflows, supporting configurable thinking effort.
DeepSeek added deepseek-v4-pro and deepseek-v4-flash to its API model catalog on April 24, 2026. Both models support thinking and non-thinking modes, 1M token context, 384K max output, JSON output, tool calls, and FIM in non-thinking mode. Official pricing lists V4 Flash at $0.14 cache-miss input / $0.028 cache-hit input / $0.28 output per 1M tokens, and V4 Pro at $1.74 cache-miss input / $0.145 cache-hit input / $3.48 output per 1M tokens.
No benchmark data available.