Updated · Methodology: named formula library
LLM Rate Limit Budget
Calculate sustainable request rate from your tokens-per-minute (TPM) limit.
TPM Limit to Tokens/Request = 200:1 (200 as decimal).
LLM Rate Limits
TPM limit / avg tokens per request = max requests per minute. Anthropic Tier 4 (Claude): 400k TPM. OpenAI Tier 5 (GPT): 30M TPM. Plan request batching against your tier.
Worked Example
5000 TPM Limit to 1000000 Tokens/Request
- a
- 5000
- b
- 1000000
- Result
- 1:200 (0.01)
5000 / 1000000 = 0.01. Simplified: 1:200.
When to Use This Calculator
- Plan API integrations against rate limits
Limitations & Common Mistakes
- Results are estimates based on the inputs you provide.
- Always verify with current data and consult a professional for major decisions.
Frequently Asked Questions
How is the LLM Rate Limit Budget computed?
TPM Limit divided by Tokens/Request, plus a simplified ratio (e.g., 4:3) using greatest common divisor. Both decimal and ratio forms are useful in different contexts: decimal for math, ratio form for comparisons or recipe scaling.
What does TPM Limit:Tokens/Request mean?
It's a comparison: for every Tokens/Request unit, you have a corresponding amount of TPM Limit. Useful when the absolute numbers matter less than the proportion (e.g., reading 8:1 LTV/CAC immediately tells you the unit economics are healthy without needing the dollar amounts).
Why simplify the ratio?
4:3 is more readable than 200:150. The simplified form (using greatest common divisor) preserves the proportion while making it easier to interpret. Common simplified ratios: 16:9 (widescreen), 4:3 (legacy displays), 3:1 (LTV:CAC for SaaS).
When is a ratio more useful than the absolute values?
Comparison across scales. A $1B company and a $1M company can both have a 3:1 LTV:CAC; the ratio reveals comparable unit economics regardless of scale. Use ratios for benchmarking; use absolute numbers for budgeting.
Related Calculators
More AI & Technology →Claude Opus 4.7 Cost Calculator
Estimate cost of Claude Opus 4.7 API calls from token volume.
Claude Sonnet 4.6 Cost Calculator
Estimate cost of Claude Sonnet 4.6 API calls from token volume.
GPT-5 API Cost Calculator
Estimate GPT-5 API cost from token volume.
Gemini 2 Pro Cost Calculator
Estimate Gemini 2 Pro API cost from token volume.
Prompt Caching Savings
Estimate cost savings from prompt caching (90% off cached input).
Embedding Batch Cost
Estimate cost of embedding a document corpus.