TAU2
Measured May 14, 2026Source
Score
0.3
Qwen3 32B (Reasoning) is a 32-billion parameter model from Alibaba's Qwen3 series, specifically optimized for complex reasoning tasks. It excels in chain-of-thought processes, logical deduction, and problem-solving, while also maintaining strong coding and long-context capabilities.
Benchmark history
Score
0.3
Score
0.03
Score
0
Score
0.36
Score
0.73
Score
0.81
Score
0.96
Score
0.35
Score
0.55
Score
0.08
Score
0.67
Score
0.8
Score
73
Score
13.8
Score
16.5
Plan availability

Thinking... Make sure you are connected to GitHub server