TAU2
Measured May 29, 2026Source
Score
0.94
A fast and efficient model from the DeepSeek V4 family, optimized for low-latency responses and general tasks. It excels in code generation and instruction following, but is not designed for complex reasoning or chain-of-thought tasks.
Benchmark history
Score
0.94
Score
0.34
Score
0.33
Score
0.47
Score
0.37
Score
0.07
Score
0.72
Score
35.2
Score
36.5
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server