TAU2
Measured May 14, 2026Source
Score
0.56
This is a lightweight reasoning model from OpenAI's o4 series, designed for efficient handling of complex reasoning tasks. It leverages an internal thinking mechanism to enhance performance in mathematics, coding, and scientific problems.
Benchmark history
Score
0.56
Score
0.15
Score
0.55
Score
0.69
Score
0.91
Score
0.94
Score
0.99
Score
0.47
Score
0.86
Score
0.18
Score
0.78
Score
0.83
Score
90.7
Score
25.6
Score
33.1
Plan availability

Thinking... Make sure you are connected to GitHub server