TAU2
Measured May 14, 2026Source
Score
0.37
DeepSeek V3.1 (Reasoning) is a specialized variant of the V3.1 model optimized for complex reasoning tasks. It incorporates an enhanced thinking process to deliver more accurate and logical solutions for problems requiring multi-step analysis.
Benchmark history
Score
0.37
Score
0.25
Score
0.53
Score
0.42
Score
0.9
Score
0.39
Score
0.78
Score
0.13
Score
0.78
Score
0.85
Score
89.7
Score
29.7
Score
27.7
Plan availability

Thinking... Make sure you are connected to GitHub server