Aime
Measured May 29, 2026Source
Score
0.13
Allen Institute for AI
Tulu 3 405B is a large language model developed by the Allen Institute for AI (AI2), fine-tuned from Meta's Llama 3.1 405B base model. It is optimized for strong instruction-following and reasoning capabilities, aiming to be a high-performance, open-source model for complex tasks.
Benchmark history
Score
0.13
Score
0.78
Score
0.3
Score
0.29
Score
0.04
Score
0.52
Score
0.72
Score
14.1
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server