Vendor pricing
Google Gemini model pricing
Every Google Gemini model JP prices, side by side — input, output, and cache rates per million tokens.
| Model | Input / 1M | Output / 1M | Cache read / 1M |
|---|---|---|---|
| Gemini 3 Pro | $2.00 | $12.00 | $0.20 |
| Gemini 3 | $2.00 | $12.00 | $0.20 |
| Gemini 3 Flash | $0.50 | $3.00 | $0.05 |
| Gemini 2.5 Pro | $1.25 | $10.00 | $0.13 |
| Gemini 2.5 Flash | $0.30 | $2.50 | $0.03 |
| Gemini 2.5 Flash-Lite | $0.10 | $0.40 | $0.01 |
Notional list price. Google Gemini usage on a subscription is metered as a flat plan, so these per-token figures are list price, not spend.
How to read this table
All prices are USD per million tokens at published API list rates. Cache reads cost about a tenth of the input rate; where a model shows a >200K tier, crossing that context threshold reprices the whole request.
Subscription note
On a subscription (Gemini CLI), these list prices are notional — what usage would cost at API rates, not metered spend.
Common questions
- Which Google Gemini model is cheapest?
- The flash / mini / haiku tiers are the lowest per-token; the pro / opus tiers cost more but reason harder. The table shows the exact rates.