DeepSeek

DeepSeek V4 Flash

DeepSeekSlug: deepseek-v4-flash

Models

Evaluations

Plan supports

Provider ranking

2 ranking rows

Provider Median TPS TTFT p50 Success Samples Window Rank

联通云

Product: 联通云大模型 Coding Plan

96.84

754 ms

100%

Rank #1

Ollama Cloud

Product: Ollama Cloud

83.57

997.5 ms

100%

Rank #2

Models in this family

3 models

DeepSeek V4 Flash (Non-reasoning)

9 evaluations

DeepSeek

A fast and efficient model from the DeepSeek V4 family, optimized for low-latency responses and general tasks. It excels in code generation and instruction following, but is not designed for complex reasoning or chain-of-thought tasks.

Input / 1M tokens

$0.14

Output / 1M tokens

$0.28

DeepSeek V4 Flash (Reasoning, High Effort)

9 evaluations

DeepSeek

This is the Flash version of the DeepSeek V4 series, optimized for reasoning tasks with a high-effort mode to enhance complex problem-solving. It delivers enhanced reasoning performance while maintaining fast response speeds.

Input / 1M tokens

$0.14

Output / 1M tokens

$0.28

DeepSeek V4 Flash (Reasoning, Max Effort)

9 evaluations

DeepSeek

DeepSeek V4 Flash is a fast-response model optimized for reasoning tasks. It is designed to deliver high-quality reasoning outputs with maximum computational effort while maintaining low latency.

Input / 1M tokens

$0.14

Output / 1M tokens

$0.28

Discussion

Thinking... Make sure you are connected to GitHub server