Aime
Measured May 14, 2026Source
Score
0.02
Hermes 3 is a suite of fine-tuned models based on Llama 3.1, optimized for enhanced instruction following, conversational ability, and reasoning. This 70B parameter variant offers a strong balance of capability and performance for complex dialogue and task execution.
Benchmark history
Score
0.02
Score
0.54
Score
0.23
Score
0.19
Score
0.04
Score
0.4
Score
0.57
Score
10.6
Plan availability

Thinking... Make sure you are connected to GitHub server