TAU2
Measured May 29, 2026Source
Score
0.56
This is a lightweight reasoning model from OpenAI's o4 series, designed for efficient handling of complex reasoning tasks. It leverages an internal thinking mechanism to enhance performance in mathematics, coding, and scientific problems.
Benchmark history
Score
0.56
Score
0.15
Score
0.55
Score
0.69
Score
0.91
Score
0.94
Score
0.99
Score
0.47
Score
0.86
Score
0.18
Score
0.78
Score
0.83
Score
90.7
Score
25.6
Score
33.1
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server