Home โ€บ Tools โ€บ LLM Provider / Model โ€บ Inflection 3

Inflection 3

LLM Provider / Model ยท Enterprise deployments requiring on-prem AI with productivity tuning

At a glance

Input price$2.50
Output price$10
Cache price$0.25
Price tierMid
Context window128K
Max output16K
Context tier128K-500K
Speed tierMedium
Latencymedium
Knowledge cutoffJul 2025
ModalityText
Model IDinflection-3-productivity
ProviderInflection AI
HumanEval89%
MMLU84%
SWE-Bench58%
BenchmarkEnterprise on-prem with competitive coding scores
ReleasedOct 2025
HostingClosed/API
CapabilitiesTool use, Function calling, Streaming, Productivity-tuned, On-prem option

What Inflection 3 does

Tool use, Function calling, Streaming, Productivity-tuned, On-prem option

Best for

Enterprise deployments requiring on-prem AI with productivity tuning

Works well with

Conflicts & caveats

Build a full stack around Inflection 3 โ€” Flowpicker shows compatibility warnings before you commit.

Open the stack planner โ†’