TAU2
Measured May 14, 2026Source
Score
0.11
A large-scale reasoning model from NVIDIA's Nemotron family, built upon the Llama 3.1 architecture. It is optimized for complex, multi-step reasoning tasks and is designed to deliver high accuracy in logical inference and problem-solving.
Benchmark history
Score
0.11
Score
0.02
Score
0.07
Score
0.38
Score
0.64
Score
0.75
Score
0.95
Score
0.35
Score
0.64
Score
0.08
Score
0.73
Score
0.83
Score
63.7
Score
13.1
Score
15
Plan availability

Thinking... Make sure you are connected to GitHub server