TAU2
Measured May 14, 2026Source
Score
0.84
Qwen3 Max Thinking is a high-end model from Alibaba's Qwen3 series, optimized for complex reasoning tasks. It features an enhanced thinking mode for deeper analysis and supports long-context processing.
Benchmark history
Score
0.84
Score
0.24
Score
0.66
Score
0.71
Score
0.43
Score
0.26
Score
0.86
Score
30.5
Score
39.8
Score
0.91
Score
0.94
Score
0.98
Score
0.79
Score
0.84
Score
91
Plan availability

Thinking... Make sure you are connected to GitHub server