TAU2
Measured May 29, 2026Source
Score
0.89
Claude Opus 4.5 (Reasoning) is Anthropic's most capable model, optimized for deep reasoning and complex problem-solving. It features an advanced 'thinking' mode that allows for extended chain-of-thought processing to tackle intricate tasks in coding, analysis, and creative work.
Benchmark history
Score
0.89
Score
0.47
Score
0.74
Score
0.58
Score
0.91
Score
0.5
Score
0.87
Score
0.28
Score
0.87
Score
0.9
Score
91.3
Score
47.8
Score
49.7
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server