Glossary
Context cliff explained
The point where a request gets billed at a higher per-token tier for exceeding a context threshold.
Why it matters for cost
Above 200K (Claude/Gemini) input is ~2x and output ~1.5x for the ENTIRE request, not just the overflow.
How JP the Cat surfaces it
JP the Cat reads the real token counts behind this from the local logs on your Mac and prices them at the shipped rates — so it shows up in your actual cost, not an estimate.
Common questions
- What is context cliff?
- The point where a request gets billed at a higher per-token tier for exceeding a context threshold.
- How does it change what I pay?
- Above 200K (Claude/Gemini) input is ~2x and output ~1.5x for the ENTIRE request, not just the overflow.