TAU2
Measured May 14, 2026Source
Score
0.55
Claude 3.7 Sonnet (Reasoning) is an enhanced version of the Sonnet model optimized for complex reasoning tasks. It features an extended thinking mode for deeper, step-by-step analysis and excels at handling long-context, multimodal inputs, and coding challenges.
Benchmark history
Score
0.55
Score
0.21
Score
0.61
Score
0.48
Score
0.56
Score
0.49
Score
0.95
Score
0.4
Score
0.47
Score
0.1
Score
0.77
Score
0.84
Score
56.3
Score
27.6
Score
34.7
Plan availability

Thinking... Make sure you are connected to GitHub server