Model families

DeepSeek

DeepSeek R1

DeepSeek logoDeepSeekSlug: deepseek-r-1

DeepSeek R1 series of reasoning models with variants.

Models
8
Evaluations
0
Plan supports
0

Models in this family

8 models

DeepSeek R1 (Jan '25)

15 evaluations

DeepSeek

DeepSeek R1 is a reasoning-focused language model from DeepSeek, optimized for complex tasks in mathematics, coding, and logic. It utilizes chain-of-thought reasoning to solve problems step-by-step and is available as an open-weight model.

Input / 1M tokens

$1.68

Output / 1M tokens

$4.70

DeepSeek R1 0528 (May '25)

15 evaluations

DeepSeek

DeepSeek R1 is a reasoning-focused model from the R1 series, optimized for complex tasks requiring step-by-step thinking. It excels in mathematics, coding, and logical reasoning by leveraging advanced reinforcement learning and chain-of-thought methodologies.

Input / 1M tokens

$1.35

Output / 1M tokens

$4.20

DeepSeek R1 0528 Qwen3 8B

15 evaluations

DeepSeek

A distilled reasoning model from DeepSeek, based on the Qwen3 8B architecture. It is optimized for mathematical and code reasoning tasks, offering strong performance in a lightweight and efficient package.

Input / 1M tokens

$0.00

Output / 1M tokens

$0.00

DeepSeek R1 Distill Llama 70B

15 evaluations

DeepSeek

A distilled version of the DeepSeek R1 reasoning model, built on the Llama 70B architecture. It inherits strong reasoning and chain-of-thought capabilities from the R1 series while being optimized for efficiency. The model excels at complex problem-solving, code generation, and tasks requiring logical deduction.

Input / 1M tokens

$0.70

Output / 1M tokens

$1.05

DeepSeek R1 Distill Llama 8B

12 evaluations

DeepSeek

A distilled 8B parameter model from the DeepSeek R1 series, optimized for fast and efficient reasoning tasks. It inherits strong reasoning capabilities from larger R1 models while being lightweight and cost-effective.

Input / 1M tokens

$0.00

Output / 1M tokens

$0.00

DeepSeek R1 Distill Qwen 1.5B

12 evaluations

DeepSeek

A distilled version of the DeepSeek R1 reasoning model, based on the Qwen 1.5B architecture. It inherits strong reasoning capabilities from the larger R1 model while being significantly smaller and faster, making it suitable for edge deployment and low-latency applications.

Input / 1M tokens

$0.00

Output / 1M tokens

$0.00

DeepSeek R1 Distill Qwen 14B

12 evaluations

DeepSeek

A distilled version of the DeepSeek R1 reasoning model, built upon the Qwen 14B architecture. It aims to deliver strong reasoning and problem-solving capabilities in a more compact and efficient form factor.

Input / 1M tokens

$0.00

Output / 1M tokens

$0.00

DeepSeek R1 Distill Qwen 32B

12 evaluations

DeepSeek

A distilled version of the DeepSeek R1 reasoning model, built on the Qwen 32B architecture. It inherits strong chain-of-thought reasoning capabilities from the larger R1 model while offering faster inference speeds and lower computational costs. This model is optimized for efficient deployment without sacrificing core reasoning performance.

Input / 1M tokens

$0.00

Output / 1M tokens

$0.00

Discussion

Thinking... Make sure you are connected to GitHub server