TAU2
Measured May 29, 2026Source
Score
0.22
Hermes 4 is a 70B parameter model from the Hermes series, fine-tuned on Llama-3.1. It is optimized for strong tool use and instruction following, making it suitable for general-purpose dialogue and task execution without a specialized reasoning focus.
Benchmark history
Score
0.22
Score
0
Score
0.02
Score
0.29
Score
0.11
Score
0.28
Score
0.27
Score
0.04
Score
0.49
Score
0.66
Score
11.3
Score
9.2
Score
12.6
Plan availability
Loading ratings...

Thinking... Make sure you are connected to GitHub server