Aime
Measured May 29, 2026Source
Score
0.11
Qwen2.5 Instruct 32B is a mid-sized, instruction-tuned language model from Alibaba's Qwen series. It excels at following instructions, multilingual tasks, and code generation while maintaining strong reasoning capabilities. The model supports a long context window of up to 128K tokens.
Benchmark history
Score
0.11
Score
0.81
Score
0.23
Score
0.25
Score
0.04
Score
0.47
Score
0.7
Score
13.2
Score
0.35
Score
0.05
Score
0.2
Score
0.37
Score
0.14
Score
14
Score
11.9
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server