Home โ€บ Tools โ€บ LLM Provider / Model โ€บ GLM 5 Air

GLM 5 Air

LLM Provider / Model ยท Cheap self-hosted coding agent, autocomplete, batch inference at scale

At a glance

Input price$0.15
Output price$0.60
Cache price$0.02
Price tierBudget
Context window128K
Max output16K
Context tier128K-500K
Speed tierFast
Latencylow
Knowledge cutoffAug 2025
ModalityText
Model IDglm-5-air
ProviderZhipu AI
HumanEval88%
MMLU80%
SWE-Bench58%
BenchmarkCheap-and-fast tier; strong code/agent task scores
ReleasedSep 2025
HostingOpen/Self-host
CapabilitiesTool use, Function calling, Streaming, MoE, Single-GPU friendly

What GLM 5 Air does

Tool use, Function calling, Streaming, MoE, Single-GPU friendly

Best for

Cheap self-hosted coding agent, autocomplete, batch inference at scale

Works well with

Conflicts & caveats

Build a full stack around GLM 5 Air โ€” Flowpicker shows compatibility warnings before you commit.

Open the stack planner โ†’