TAU2
Measured May 14, 2026Source
Score
0.37
DeepSeek V3.1 Terminus (Reasoning) is a specialized variant of the V3.1 model optimized for complex reasoning tasks. It likely incorporates advanced chain-of-thought or thinking mechanisms to enhance performance on logic, analysis, and problem-solving challenges.
Benchmark history
Score
0.37
Score
0.3
Score
0.65
Score
0.57
Score
0.9
Score
0.41
Score
0.8
Score
0.15
Score
0.79
Score
0.85
Score
89.7
Score
33.7
Score
33.9
Plan availability

Thinking... Make sure you are connected to GitHub server