Aime
Measured May 14, 2026Source
Score
0.21
Qwen3 4B (Non-reasoning) is a lightweight, 4-billion parameter language model from Alibaba's Qwen3 series, optimized for fast and cost-effective inference. It is designed for general-purpose tasks and edge deployment, offering a balance of performance and efficiency without the overhead of complex reasoning chains.
Benchmark history
Score
0.21
Score
0.84
Score
0.17
Score
0.23
Score
0.04
Score
0.4
Score
0.59
Score
12.5
Score
0.84
Score
0.24
Score
0.67
Score
0.71
Score
0.91
Score
91
Score
30.5
Plan availability

Thinking... Make sure you are connected to GitHub server