Models

DeepSeek R1 Distill Qwen 32B

A distilled version of the DeepSeek R1 reasoning model, built on the Qwen 32B architecture. It inherits strong chain-of-thought reasoning capabilities from the larger R1 model while offering faster inference speeds and lower computational costs. This model is optimized for efficient deployment without sacrificing core reasoning performance.

ReasoningFastCheap
Input / 1M tokens
$0.00
Output / 1M tokens
$0.00
Supported plans
4

Benchmark history

Evaluations

15

Lcr

Measured May 14, 2026Source

Score

0.1

Ifbench

Measured May 14, 2026Source

Score

0.23

Aime 25

Measured May 14, 2026Source

Score

0.63

Aime

Measured May 14, 2026Source

Score

0.69

Math 500

Measured May 14, 2026Source

Score

0.94

Scicode

Measured May 14, 2026Source

Score

0.38

Livecodebench

Measured May 14, 2026Source

Score

0.27

Hle

Measured May 14, 2026Source

Score

0.06

Gpqa

Measured May 14, 2026Source

Score

0.62

Mmlu Pro

Measured May 14, 2026Source

Score

0.74

Artificial Analysis Math Index

Measured May 14, 2026Source

Score

63

Artificial Analysis Intelligence Index

Measured May 14, 2026Source

Score

17.2

TAU2

Measured May 14, 2026Source

Score

0.37

Terminalbench Hard

Measured May 14, 2026Source

Score

0.16

Artificial Analysis Coding Index

Measured May 14, 2026Source

Score

24

Plan availability

Products and plans that support this model

2

Discussion

Thinking... Make sure you are connected to GitHub server