Provider ranking
2 ranking rows
Ollama Cloud
Product: Ollama CloudModels in this family
3 models
DeepSeek V4 Flash (Non-reasoning)
9 evaluationsDeepSeek
A fast and efficient model from the DeepSeek V4 family, optimized for low-latency responses and general tasks. It excels in code generation and instruction following, but is not designed for complex reasoning or chain-of-thought tasks.
Input / 1M tokens
$0.14
Output / 1M tokens
$0.28
DeepSeek V4 Flash (Reasoning, High Effort)
9 evaluationsDeepSeek
This is the Flash version of the DeepSeek V4 series, optimized for reasoning tasks with a high-effort mode to enhance complex problem-solving. It delivers enhanced reasoning performance while maintaining fast response speeds.
Input / 1M tokens
$0.14
Output / 1M tokens
$0.28
DeepSeek V4 Flash (Reasoning, Max Effort)
9 evaluationsDeepSeek
DeepSeek V4 Flash is a fast-response model optimized for reasoning tasks. It is designed to deliver high-quality reasoning outputs with maximum computational effort while maintaining low latency.
Input / 1M tokens
$0.14
Output / 1M tokens
$0.28
Discussion

Thinking... Make sure you are connected to GitHub server

