Updated · Methodology: named formula library
LLM Throughput Cost
Estimate per-day cost from throughput (tokens/second).
TPS to Price/MTok = 10:3 (3 as decimal).
Throughput Cost
Tokens/sec × 86,400 sec/day × cost/MTok / 1M = daily cost. Useful for streaming chat or batch workloads. Most production teams cap at 100 TPS per key to avoid runaway bills.
Worked Example
50 TPS to 15 Price/MTok
- a
- 50
- b
- 15
- Result
- 10:3 (3.33)
50 / 15 = 3.33. Simplified: 10:3.
When to Use This Calculator
- Budget streaming AI features
Limitations & Common Mistakes
- Results are estimates based on the inputs you provide.
- Always verify with current data and consult a professional for major decisions.
Frequently Asked Questions
How is the LLM Throughput Cost computed?
TPS divided by Price/MTok, plus a simplified ratio (e.g., 4:3) using greatest common divisor. Both decimal and ratio forms are useful in different contexts: decimal for math, ratio form for comparisons or recipe scaling.
What does TPS:Price/MTok mean?
It's a comparison: for every Price/MTok unit, you have a corresponding amount of TPS. Useful when the absolute numbers matter less than the proportion (e.g., reading 8:1 LTV/CAC immediately tells you the unit economics are healthy without needing the dollar amounts).
Why simplify the ratio?
4:3 is more readable than 200:150. The simplified form (using greatest common divisor) preserves the proportion while making it easier to interpret. Common simplified ratios: 16:9 (widescreen), 4:3 (legacy displays), 3:1 (LTV:CAC for SaaS).
When is a ratio more useful than the absolute values?
Comparison across scales. A $1B company and a $1M company can both have a 3:1 LTV:CAC; the ratio reveals comparable unit economics regardless of scale. Use ratios for benchmarking; use absolute numbers for budgeting.
Related Calculators
More AI & Technology →Claude Opus 4.7 Cost Calculator
Estimate cost of Claude Opus 4.7 API calls from token volume.
Claude Sonnet 4.6 Cost Calculator
Estimate cost of Claude Sonnet 4.6 API calls from token volume.
GPT-5 API Cost Calculator
Estimate GPT-5 API cost from token volume.
Gemini 2 Pro Cost Calculator
Estimate Gemini 2 Pro API cost from token volume.
LLM Rate Limit Budget
Calculate sustainable request rate from your tokens-per-minute (TPM) limit.
Prompt Caching Savings
Estimate cost savings from prompt caching (90% off cached input).