Aime
Measured May 14, 2026Source
Score
0.13
Allen Institute for AI
Tulu 3 405B is a large language model developed by the Allen Institute for AI (AI2), fine-tuned from Meta's Llama 3.1 405B base model. It is optimized for strong instruction-following and reasoning capabilities, aiming to be a high-performance, open-source model for complex tasks.
Benchmark history
Score
0.13
Score
0.78
Score
0.3
Score
0.29
Score
0.04
Score
0.52
Score
0.72
Score
14.1
Plan availability

Thinking... Make sure you are connected to GitHub server