Models

Nous Research

Hermes 4 - Llama-3.1 70B (Reasoning)

Hermes 4 is a fine-tuned version of Llama-3.1 70B, specifically optimized for enhanced reasoning and chain-of-thought capabilities. It excels at complex problem-solving, logical deduction, and following intricate instructions, making it suitable for tasks requiring deep analysis. As part of the Hermes series, it maintains strong tool-use and coding proficiency.

ReasoningCoding
Input / 1M tokens
$0.13
Output / 1M tokens
$0.40
Output tokens/s
79.37
First-token seconds
0.59s
Supported plans
0

Benchmark history

Evaluations

13

TAU2

Measured May 14, 2026Source

Score

0.23

Terminalbench Hard

Measured May 14, 2026Source

Score

0.05

Lcr

Measured May 14, 2026Source

Score

0.07

Ifbench

Measured May 14, 2026Source

Score

0.31

Aime 25

Measured May 14, 2026Source

Score

0.69

Scicode

Measured May 14, 2026Source

Score

0.34

Livecodebench

Measured May 14, 2026Source

Score

0.65

Hle

Measured May 14, 2026Source

Score

0.08

Gpqa

Measured May 14, 2026Source

Score

0.7

Mmlu Pro

Measured May 14, 2026Source

Score

0.81

Artificial Analysis Math Index

Measured May 14, 2026Source

Score

68.7

Artificial Analysis Coding Index

Measured May 14, 2026Source

Score

14.4

Artificial Analysis Intelligence Index

Measured May 14, 2026Source

Score

16

Plan availability

Products and plans that support this model

0
No products or plans have been linked to this model yet.

Discussion

Thinking... Make sure you are connected to GitHub server