TAU2
Measured May 29, 2026Source
Score
0.94
A high-effort reasoning model from the DeepSeek V4 series, optimized for complex problem-solving and deep analytical tasks. It likely employs extended thinking or chain-of-thought processes to tackle challenging queries in coding, mathematics, and logic.
Benchmark history
Score
0.94
Score
0.42
Score
0.65
Score
0.71
Score
0.46
Score
0.34
Score
0.91
Score
43.2
Score
49.8
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server