Home โ€บ Tools โ€บ LLM Provider / Model โ€บ Hermes 4

Hermes 4

LLM Provider / Model ยท Self-hosted assistants with steerable persona and explicit chain-of-thought support

At a glance

Input price$0.90
Output price$2.70
Cache price$0.10
Price tierMid
Context window128K
Max output16K
Context tier128K-500K
Speed tierMedium
Latencymedium
Knowledge cutoffAug 2025
ModalityText
Model IDhermes-4-405b
ProviderNous Research
HumanEval89%
MMLU85%
SWE-Bench60%
BenchmarkStrong on reasoning + alignment benchmarks
ReleasedSep 2025
HostingOpen/Self-host
CapabilitiesTool use, Hybrid reasoning, Function calling, Streaming, Steerable persona, Uncensored

What Hermes 4 does

Tool use, Hybrid reasoning, Function calling, Streaming, Steerable persona, Uncensored

Best for

Self-hosted assistants with steerable persona and explicit chain-of-thought support

Works well with

Conflicts & caveats

Build a full stack around Hermes 4 โ€” Flowpicker shows compatibility warnings before you commit.

Open the stack planner โ†’