Gemini 2.5 Flash pricing
Gemini 2.5 Flash is a budget-tier LLM from Google. Here's the full token-price breakdown and what it actually costs per month at real coding workloads.
Token pricing
| Input tokens | $0.30 / 1M tokens |
| Output tokens | $2.50 / 1M tokens |
| Price tier | Budget |
What it costs per month
Estimated API cost at three typical AI-coding workloads (caching off — real bills are usually lower):
| Workload | Volume | Est. cost |
|---|---|---|
| Light (hobby) | 2M in / 0.5M out | $1.9/mo |
| Daily driver | 15M in / 4M out | $14.5/mo |
| Heavy / agentic | 80M in / 20M out | $74.0/mo |
Estimates assume the listed output price. Prompt caching (where available) can cut input cost substantially on repeated context.
Cheaper alternatives
- Llama 3 (Ollama/Groq) — $0/1M input (28% SWE-bench)
- Qwen 3 (235B) — $0/1M input (53% SWE-bench)
- Qwen 3.6 — $0/1M input (57% SWE-bench)
Pair Gemini 2.5 Flash with the right tools — Flowpicker flags model/IDE compatibility before you spend.
Open the stack planner →