TAU2
Measured May 29, 2026Source
Score
0.37
DeepSeek V3.1 (Reasoning) is a specialized variant of the V3.1 model optimized for complex reasoning tasks. It incorporates an enhanced thinking process to deliver more accurate and logical solutions for problems requiring multi-step analysis.
Benchmark history
Score
0.37
Score
0.25
Score
0.53
Score
0.42
Score
0.9
Score
0.39
Score
0.78
Score
0.13
Score
0.78
Score
0.85
Score
89.7
Score
29.7
Score
27.7
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server