TAU2
Measured May 14, 2026Source
Score
0.71
Claude 4.1 Opus (Reasoning) is a high-performance model from Anthropic's Claude family, specifically optimized for deep reasoning and complex problem-solving tasks. It features an extended thinking mode to tackle multi-step logical challenges and is part of the multimodal Claude series supporting long-context interactions.
Benchmark history
Score
0.71
Score
0.34
Score
0.66
Score
0.55
Score
0.8
Score
0.41
Score
0.65
Score
0.12
Score
0.81
Score
0.88
Score
80.3
Score
36.5
Score
42
Plan availability

Thinking... Make sure you are connected to GitHub server