TAU2
Measured May 29, 2026Source
Score
0.11
A large-scale reasoning model from NVIDIA's Nemotron family, built upon the Llama 3.1 architecture. It is optimized for complex, multi-step reasoning tasks and is designed to deliver high accuracy in logical inference and problem-solving.
Benchmark history
Score
0.11
Score
0.02
Score
0.07
Score
0.38
Score
0.64
Score
0.75
Score
0.95
Score
0.35
Score
0.64
Score
0.08
Score
0.73
Score
0.83
Score
63.7
Score
13.1
Score
15
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server