🧮
Free Tool
AI API Cost Calculator
Estimate your monthly spend across OpenAI, Anthropic, AWS Bedrock, Google Vertex, and Azure OpenAI. Adjust your usage and compare side-by-side in real time.
No signup required · Updated April 2026 pricing
Common Questions
How accurate is this calculator?
The pricing reflects published list prices as of April 2026 for each provider. Real bills can vary due to caching credits, committed use discounts, volume tiers, and billing lag. Use this as a planning estimate, not a contract guarantee. Providers update pricing periodically — always verify against the provider's current pricing page.
Why is AWS Bedrock more expensive than direct Anthropic?
AWS Bedrock charges a markup (~10%) over Anthropic's direct API prices for hosting and infrastructure. The tradeoff is VPC access, AWS IAM integration, no data leaving your AWS region, and a single consolidated bill. For compliance-heavy use cases, the premium is worth it.
What counts as an "input token" vs "output token"?
Input tokens are everything in the prompt — system message, user message, conversation history, retrieved context. Output tokens are what the model generates in response. Output tokens cost significantly more across all providers because they require more compute. A typical RAG application might have 800 input tokens and 300 output tokens per call.
GPT-4o mini vs Claude Haiku — which is cheaper?
It depends on your input/output ratio. GPT-4o mini is cheaper on input ($0.15 vs $0.25 per 1M). Claude Haiku is cheaper on output ($1.25 vs $0.60 per 1M). For output-heavy workloads (long generations, detailed reports), GPT-4o mini wins. For input-heavy workloads (large context windows, document processing), Claude Haiku wins. Use this calculator with your actual token split to find out.
Does Azure OpenAI cost the same as OpenAI direct?
At list prices, yes — Azure OpenAI mirrors OpenAI's pricing for the same models. The difference is deployment model (your own Azure resource vs shared OpenAI infrastructure), data residency, SLA guarantees, and enterprise support. Azure also has reserved capacity and provisioned throughput pricing that can be significantly cheaper at scale.
How do I track actual AI spend instead of estimating?
PayMesh connects to your provider accounts via API keys and pulls real usage data, normalizing it across all providers into a single dashboard. You can set budget alerts, see per-agent or per-team breakdowns, and get alerted before a bill surprises you.
Start free here.