Free Tool

AWS Bedrock Token Cost Calculator

See how much you'll spend on LLM inference — pick a model, enter your token volume, get monthly costs in seconds.

Step 1 of 3 33%

Which Bedrock model are you using?

Select the model you're running inference on. Prices are per 1,000 tokens (input/output may differ).

Claude 3.5 Haiku Fast, small, budget-friendly $0.80 / 1M input Claude 3.5 Sonnet Balanced performance & cost $3.00 / 1M input Claude 3 Opus Most capable, premium pricing $15.00 / 1M input Amazon Nova Lite Ultra-low cost, multimodal $0.06 / 1M input Amazon Nova Pro More capable than Lite, still cheap $0.80 / 1M input Llama 3 70B Open-source alternative $0.99 / 1M input

Who This Tool Is For

Engineering teams, data scientists, and product managers evaluating Bedrock for production workloads. If you're building with Claude, Nova, or Llama on Bedrock and need to forecast costs, this tool gives you quick, accurate estimates.

Why We Built This Tool

Bedrock pricing can be opaque when you're doing capacity planning. This calculator lets you plug in your token volume and get an instant monthly cost estimate. No algebra, no spreadsheets.

What Problem It Solves

Cost forecasting. Know upfront what your Bedrock bill will be based on token volume and model choice.
Model comparison. See how much more Nova costs vs. Claude, or how much cheaper Haiku is.
Budget planning. Allocate costs accurately without over- or under-estimating LLM spend.
Right-sizing decisions. Understand the trade-off between model capability and cost per token.

Learn more about our AWS Bedrock consulting services to optimize your inference architecture.

Frequently Asked Questions

Does Bedrock have a free tier?

No free tier, but you can use AWS free tier credits if you're a new account or have AWS Activate credits. After that, all inference is pay-per-token.

What about context window tokens?

This calculator assumes standard 200K context for Claude and Nova. Large context windows use the same per-token pricing but may require pre-processing for very large prompts.

Does prompt caching save money?

Yes. Cached input tokens cost 90% less than regular input tokens. If you repeat the same context (system prompts, documents), caching dramatically reduces costs.

What about batch processing?

Bedrock has a Batch API with 50% discount on input token costs. Use it for non-real-time workloads where latency isn't critical.

100+

Bedrock projects

40%

Avg cost reduction

50+

AWS certifications

AWS Bedrock Token Cost Calculator

Which Bedrock model are you using?

What's your daily inference volume?

Your Estimated Monthly Cost

Cost Breakdown

Detailed Cost Analysis

Who This Tool Is For

Why We Built This Tool

What Problem It Solves

Frequently Asked Questions

Get Your Detailed Report

Done!