Home โ€บ Tools โ€บ LLM Provider / Model โ€บ Step 3.5 Flash

Step 3.5 Flash

LLM Provider / Model ยท High-throughput low-cost reasoning, real-time agent loops, budget-tier production deployments, fastest open-weight reasoning model in its price class

At a glance

Input price$0.10
Output price$0.30
Price tierBudget
Context window262K
Max output64K
Context tier128K-500K
Speed tierFast
Latencylocal-bound
Knowledge cutoffNov 2025
ModalityText-only
Model IDstep-3.5-flash
ProviderStepFun
HumanEval82%
MMLU79%
BenchmarkAA Intelligence Index: 38
ReleasedJan 2026
HostingOpen-weights
CapabilitiesTool use, Streaming, Structured output, Sparse MoE (196B / 11B active), Fast inference (197 tok/s), Function calling, Self-hostable

What Step 3.5 Flash does

Tool use, Streaming, Structured output, Sparse MoE (196B / 11B active), Fast inference (197 tok/s), Function calling, Self-hostable

Best for

High-throughput low-cost reasoning, real-time agent loops, budget-tier production deployments, fastest open-weight reasoning model in its price class

Works well with

Conflicts & caveats

Build a full stack around Step 3.5 Flash โ€” Flowpicker shows compatibility warnings before you commit.

Open the stack planner โ†’