TAU2
Measured May 14, 2026Source
Score
0.24
Qwen3 235B A22B (Reasoning) is a large-scale language model from Alibaba's Qwen3 series, optimized for complex reasoning tasks. It utilizes a Mixture-of-Experts (MoE) architecture with 235B total parameters and 22B activated parameters, balancing high performance with computational efficiency. The model excels in instruction following and multi-step logical reasoning.
Benchmark history
Score
0.24
Score
0.06
Score
0
Score
0.39
Score
0.82
Score
0.84
Score
0.93
Score
0.4
Score
0.62
Score
0.12
Score
0.7
Score
0.83
Score
82
Score
17.4
Score
19.8
Plan availability

Thinking... Make sure you are connected to GitHub server