TAU2
Measured May 14, 2026Source
Score
0.89
Claude Opus 4.5 (Reasoning) is Anthropic's most capable model, optimized for deep reasoning and complex problem-solving. It features an advanced 'thinking' mode that allows for extended chain-of-thought processing to tackle intricate tasks in coding, analysis, and creative work.
Benchmark history
Score
0.89
Score
0.47
Score
0.74
Score
0.58
Score
0.91
Score
0.5
Score
0.87
Score
0.28
Score
0.87
Score
0.9
Score
91.3
Score
47.8
Score
49.7
Plan availability

Thinking... Make sure you are connected to GitHub server