Aime
Measured May 14, 2026Source
Score
0.11
Qwen2.5 Instruct 32B is a mid-sized, instruction-tuned language model from Alibaba's Qwen series. It excels at following instructions, multilingual tasks, and code generation while maintaining strong reasoning capabilities. The model supports a long context window of up to 128K tokens.
Benchmark history
Score
0.11
Score
0.81
Score
0.23
Score
0.25
Score
0.04
Score
0.47
Score
0.7
Score
13.2
Score
0.35
Score
0.05
Score
0.2
Score
0.37
Score
0.14
Score
14
Score
11.9
Plan availability

Thinking... Make sure you are connected to GitHub server