How does AI API pricing work?

AI APIs charge per token — roughly 4 characters or 0.75 words. Most models bill input (your prompt) and output (the response) at different rates. Enter your expected token counts and call volume to see your real monthly cost.

Which AI API is cheapest?

DeepSeek V4 Flash and Mistral Small-4 are currently the most affordable at under $0.20/1M input tokens. For high-quality outputs, GPT-4o mini and Claude Haiku offer the best quality-to-cost ratio.

How many tokens is 1,000 words?

Approximately 1,333 tokens. Rule of thumb: 1 token ≈ 0.75 words. Code tokenizes differently — usually more tokens per line due to whitespace and syntax.

What is the difference between input and output tokens?

Input tokens are your prompt — system instructions, user messages, conversation history. Output tokens are the model's response. Output is typically 3–6× more expensive per token than input.

AI API Pricing Calculator — Compare OpenAI, Anthropic, Google & More

Input tokens per call (your prompt)

Output tokens per call (model response)

Calls per day total API calls per day

Preset:

Cost Comparison Sorted by monthly cost · lowest first

Provider	Model	Input / 1M	Output / 1M	Per call	Daily	Monthly ▾

💡

Token Counter Free

Paste any text — a system prompt, a user message, a doc page. This tells you how many tokens it uses, so you can plug the exact number into the calculator above.

Quick reference

~50–150 — short user message

~200–500 — system prompt

~600–800 — one page of text

~1,333 — per 1,000 words

0 Tokens

0 Words

0 Chars

Full Reference

All Model Prices at a Glance

Flat list of every provider and model. Prices per 1M tokens unless noted.

Provider	Model	Input / 1M	Output / 1M	Best for
OpenAI	gpt-5.5	$5.00	$30.00	Current flagship, released Apr 2026
OpenAI	gpt-5.4	$2.50	$15.00	General purpose, vision, multimodal
OpenAI	gpt-5.4-mini	$0.75	$4.50	High-volume tasks, fast responses
OpenAI	gpt-5.4-nano	$0.20	$1.25	Ultra-budget, simple tasks
Anthropic	claude-opus-4.7	$5.00	$25.00	Complex reasoning, long documents
Anthropic	claude-sonnet-4.6	$3.00	$15.00	Balanced quality and speed
Anthropic	claude-haiku-4.5	$1.00	$5.00	Fast, lightweight tasks
Google	gemini-3.5-flash	$1.50	$9.00	Latest Flash, launched May 2026
Google	gemini-3-flash-preview	$0.50	$3.00	High-volume, low-latency
Google	gemini-3.1-pro-preview	$2.00	$12.00	Advanced reasoning, code
Mistral	mistral-large-3	$0.50	$1.50	Flagship quality, surprisingly cheap
Mistral	mistral-medium-3.5	$1.50	$7.50	European compliance, multilingual
Mistral	mistral-small-4	$0.15	$0.60	Cheapest capable model
xAI	grok-4.3	$1.25	$2.50	Low output cost, real-time data
xAI	grok-4.20	$2.00	$6.00	Mid-tier, current active SKU
DeepSeek	deepseek-v4-flash	$0.14	$0.28	Cheapest overall, surprisingly capable
DeepSeek	deepseek-v4-pro	$1.74	$3.48	High quality at low cost

FAQ

Common Questions

APIs charge per token — roughly 4 characters or 0.75 words. Your prompt (input) and the model's response (output) are billed at separate rates. Output is almost always more expensive. Enter your expected token counts and calls per day above to see exactly what you'd pay.

DeepSeek V4 Flash ($0.14/$0.28 per 1M tokens) and Mistral Small-4 ($0.15/$0.60) are the most affordable. For production workloads where quality matters, GPT-5.4 nano and Gemini Flash are strong budget picks. Use the calculator above to compare your actual use case — rankings shift depending on your input/output ratio.

Approximately 1,333 tokens. The rule of thumb: 1 token ≈ 0.75 words. A typical system prompt is 200–500 tokens. A full document page is ~600–800 tokens. Paste your text into the token counter on the right for a precise estimate.

Generating tokens requires significantly more compute than reading them. The model processes your input in parallel, but generates output one token at a time. This is why output pricing is typically 3–10× higher. Tighter, more structured prompts that reduce output length are the fastest way to cut API costs.

Yes — the Batch API gives 50% off on most OpenAI models for requests that can tolerate up to 24-hour turnaround. Ideal for data processing, content generation, or any non-real-time task. For very high volumes, custom enterprise pricing is available directly from OpenAI.

It depends on your use case. OpenAI GPT-5.4 and Anthropic Claude Sonnet are the most reliable general-purpose choices with strong tool-use and instruction-following. DeepSeek and Mistral are dramatically cheaper and worth testing for structured or lower-stakes tasks. Use this calculator to find out exactly how much you'd save — then run a quality test on your actual prompts before committing.

AI API Pricing Calculator

All Model Prices at a Glance

Common Questions

Building something with AI?