Home โ€บ Tools โ€บ LLM Provider / Model โ€บ Jamba Mini 2

Jamba Mini 2

LLM Provider / Model ยท Cost-efficient long-context workflows, document Q&A, summarization at 256K, enterprise on-prem deployment, 2.5x throughput vs Transformer-only

At a glance

Input priceFree (self-hosted)
Output priceFree (self-hosted)
Price tierFree
Context window256K
Max output16K
Context tier128K-500K
Speed tierFast
Latencylocal-bound
Knowledge cutoffOct 2025
ModalityText-only
Model IDjamba-mini-2-2026-01
ProviderAI21 Labs
HumanEval76%
MMLU76%
BenchmarkArena Hard: high
ReleasedJan 2026
HostingOpen-weights
CapabilitiesTool use, Streaming, Structured output, SSM-Transformer hybrid (Mamba), 2.5x faster inference, Function calling, RULER-validated long context, Self-hostable

What Jamba Mini 2 does

Tool use, Streaming, Structured output, SSM-Transformer hybrid (Mamba), 2.5x faster inference, Function calling, RULER-validated long context, Self-hostable

Best for

Cost-efficient long-context workflows, document Q&A, summarization at 256K, enterprise on-prem deployment, 2.5x throughput vs Transformer-only

Works well with

Conflicts & caveats

Build a full stack around Jamba Mini 2 โ€” Flowpicker shows compatibility warnings before you commit.

Open the stack planner โ†’