Home โ€บ Tools โ€บ LLM Provider / Model โ€บ Nemotron 3 Super

Nemotron 3 Super

LLM Provider / Model ยท Top-tier open-weight agentic coding, 1M-context refactors, GPU-rich self-hosted deployments, NVIDIA ecosystem (NIM/NeMo), governed enterprise environments

At a glance

Input priceFree (self-hosted)
Output priceFree (self-hosted)
Price tierFree
Context window1M+
Max output32K
Context tier500K+
Speed tierFast
Latencylocal-bound
Knowledge cutoffJan 2026
ModalityText-only
Model IDnemotron-3-super-120b-a12b
ProviderNVIDIA
HumanEval89%
MMLU85%
SWE-Bench60%
BenchmarkSWE-bench Verified: top open-weight
ReleasedMar 2026
HostingOpen-weights
CapabilitiesTool use, Streaming, Structured output, Hybrid Mamba-Transformer (120B / 12B active), Agentic coding, Function calling, 2.2x throughput vs GPT-OSS-120B, Self-hostable

What Nemotron 3 Super does

Tool use, Streaming, Structured output, Hybrid Mamba-Transformer (120B / 12B active), Agentic coding, Function calling, 2.2x throughput vs GPT-OSS-120B, Self-hostable

Best for

Top-tier open-weight agentic coding, 1M-context refactors, GPU-rich self-hosted deployments, NVIDIA ecosystem (NIM/NeMo), governed enterprise environments

Works well with

Conflicts & caveats

Build a full stack around Nemotron 3 Super โ€” Flowpicker shows compatibility warnings before you commit.

Open the stack planner โ†’