Claude Haiku 4.5

LLM Provider / Model · High-volume quick tasks, cost-sensitive agentic loops, inline completions

At a glance

Input price	$1
Output price	$5
Cache price	$0.10
Price tier	Budget
Context window	200K
Max output	64K
Context tier	128K-500K
Speed tier	Fast
Latency	fast
Knowledge cutoff	Feb 2025
Modality	Multimodal (vision)
Model ID	claude-haiku-4-5
Provider	Anthropic
HumanEval	88%
MMLU	80%
SWE-Bench	40%
Benchmark	fast tier: strong
Released	Oct 2025
Hosting	Closed/API
Capabilities	Vision, Tool use, Streaming, Structured output, Prompt caching

Vision, Tool use, Streaming, Structured output, Prompt caching

High-volume quick tasks, cost-sensitive agentic loops, inline completions

OpenAI Codex (VS Code) is tied to OpenAI's GPT-5.1 Codex family — Claude Haiku 4.5 is not selectable inside this integration.
Gemini CLI is locked to Google Gemini models — your LLM is Claude Haiku 4.5.
Devin uses Cognition's proprietary model internally — your selected Claude Haiku 4.5 will be ignored.
OpenAI Codex (Cloud) runs GPT-5.1 Codex internally — your selected Claude Haiku 4.5 will be ignored.
Jules runs Gemini models internally — your selected Claude Haiku 4.5 will be ignored.
Lovable uses a fixed internal model — your selected Claude Haiku 4.5 will be ignored.
Replit Agent 3 uses its own model selection internally — your selected Claude Haiku 4.5 will be ignored.

Build a full stack around Claude Haiku 4.5 — Flowpicker shows compatibility warnings before you commit.