TAU2
Measured May 14, 2026Source
Score
0.74
Qwen3 Max is Alibaba Cloud's flagship large language model, designed for high-performance general tasks. It features strong multimodal understanding, a 128K long context window, and excels in complex reasoning and code generation.
Benchmark history
Score
0.74
Score
0.2
Score
0.47
Score
0.44
Score
0.81
Score
0.38
Score
0.77
Score
0.11
Score
0.76
Score
0.84
Score
80.7
Score
26.4
Score
31.4
Score
0.94
Score
0.98
Plan availability

Thinking... Make sure you are connected to GitHub server