TAU2
Measured May 14, 2026Source
Score
0.65
This is a mid-tier model from the Claude 4 series, optimized for advanced reasoning and chain-of-thought processing. It balances strong performance in complex problem-solving, code generation, and multimodal understanding with efficient speed and cost.
Benchmark history
Score
0.65
Score
0.31
Score
0.65
Score
0.55
Score
0.74
Score
0.77
Score
0.99
Score
0.4
Score
0.66
Score
0.1
Score
0.78
Score
0.84
Score
74.3
Score
34.1
Score
38.7
Plan availability

Thinking... Make sure you are connected to GitHub server