TAU2
Measured May 29, 2026Source
Score
0.96
Grok 4.20 0309 (Reasoning) is a reasoning-focused model from xAI's Grok 4 series. It is optimized for complex problem-solving, logical deduction, and tasks requiring step-by-step thinking or chain-of-thought processes.
Benchmark history
Score
0.96
Score
0.41
Score
0.59
Score
0.83
Score
0.45
Score
0.3
Score
0.89
Score
42.2
Score
48.5
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server