TAU2
Measured May 29, 2026Source
Score
0.11
DeepSeek R1 is a reasoning-focused language model from DeepSeek, optimized for complex tasks in mathematics, coding, and logic. It utilizes chain-of-thought reasoning to solve problems step-by-step and is available as an open-weight model.
Benchmark history
Score
0.11
Score
0.06
Score
0.52
Score
0.39
Score
0.68
Score
0.68
Score
0.97
Score
0.36
Score
0.62
Score
0.09
Score
0.71
Score
0.84
Score
68
Score
15.9
Score
18.8
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server