TAU2
Measured May 29, 2026Source
Score
0.65
This is a mid-tier model from the Claude 4 series, optimized for advanced reasoning and chain-of-thought processing. It balances strong performance in complex problem-solving, code generation, and multimodal understanding with efficient speed and cost.
Benchmark history
Score
0.65
Score
0.31
Score
0.65
Score
0.55
Score
0.74
Score
0.77
Score
0.99
Score
0.4
Score
0.66
Score
0.1
Score
0.78
Score
0.84
Score
74.3
Score
34.1
Score
38.7
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server