TAU2
Measured May 14, 2026Source
Score
0.19
Qwen3 4B (Reasoning) is a compact 4-billion parameter model from Alibaba's Qwen3 series, optimized for reasoning tasks. It likely incorporates a chain-of-thought or thinking mode to enhance logical problem-solving while maintaining low latency and cost. This model is suitable for deployment in resource-constrained environments requiring efficient reasoning capabilities.
Benchmark history
Score
0.19
Score
0
Score
0.33
Score
0.22
Score
0.66
Score
0.93
Score
0.04
Score
0.47
Score
0.05
Score
0.52
Score
0.7
Score
22.3
Score
14.2
Score
0.24
Score
30.5
Plan availability

Thinking... Make sure you are connected to GitHub server