Aime
Measured May 14, 2026Source
Score
0.23
Qwen2.5 Max is Alibaba Cloud's flagship large language model, excelling in complex reasoning, code generation, and multimodal understanding. It supports an extremely long context window and is designed for high-performance enterprise and research applications.
Benchmark history
Score
0.23
Score
0.83
Score
0.34
Score
0.36
Score
0.05
Score
0.59
Score
0.76
Score
16.3
Score
0.35
Score
0.05
Score
0.2
Score
0.37
Score
0.14
Score
14
Score
11.9
Plan availability

Thinking... Make sure you are connected to GitHub server