Aime
Measured May 29, 2026Source
Score
0.02
Hermes 3 is a suite of fine-tuned models based on Llama 3.1, optimized for enhanced instruction following, conversational ability, and reasoning. This 70B parameter variant offers a strong balance of capability and performance for complex dialogue and task execution.
Benchmark history
Score
0.02
Score
0.54
Score
0.23
Score
0.19
Score
0.04
Score
0.4
Score
0.57
Score
10.6
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server