TAU2
Measured May 14, 2026Source
Score
0.22
Nous Research
Hermes 4 is a 70B parameter model from the Hermes series, fine-tuned on Llama-3.1. It is optimized for strong tool use and instruction following, making it suitable for general-purpose dialogue and task execution without a specialized reasoning focus.
Benchmark history
Score
0.22
Score
0
Score
0.02
Score
0.29
Score
0.11
Score
0.28
Score
0.27
Score
0.04
Score
0.49
Score
0.66
Score
11.3
Score
9.2
Score
12.6
Plan availability

Thinking... Make sure you are connected to GitHub server