TAU2
Measured May 14, 2026Source
Score
0.11
DeepSeek R1 is a reasoning-focused language model from DeepSeek, optimized for complex tasks in mathematics, coding, and logic. It utilizes chain-of-thought reasoning to solve problems step-by-step and is available as an open-weight model.
Benchmark history
Score
0.11
Score
0.06
Score
0.52
Score
0.39
Score
0.68
Score
0.68
Score
0.97
Score
0.36
Score
0.62
Score
0.09
Score
0.71
Score
0.84
Score
68
Score
15.9
Score
18.8
Plan availability

Thinking... Make sure you are connected to GitHub server