TAU2
Measured May 14, 2026Source
Score
0.35
Qwen3 14B (Reasoning) is a 14-billion parameter model from Alibaba's Qwen3 series, specifically optimized for complex reasoning tasks. It excels at chain-of-thought and step-by-step logical problem-solving, offering a strong balance between advanced reasoning capabilities and computational efficiency.
Benchmark history
Score
0.35
Score
0.04
Score
0
Score
0.41
Score
0.56
Score
0.76
Score
0.96
Score
0.32
Score
0.52
Score
0.04
Score
0.6
Score
0.77
Score
55.7
Score
13.1
Score
16.2
Plan availability

Thinking... Make sure you are connected to GitHub server