TAU2
Measured May 29, 2026Source
Score
0.89
xAI develops advanced AI models, including the Grok series. Its flagship model, Grok 4.3, is a reasoning model designed for complex tasks with a large context window and tool calling capabilities.
Benchmark history
Score
0.89
Score
0.27
Score
0.64
Score
0.81
Score
0.42
Score
0.17
Score
0.84
Score
31.6
Score
43.9
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server