🤖 AI Model Cost Calculator

Compare API pricing across 100+ AI models from OpenAI, Anthropic, Google, Meta and more. Enter your expected token usage to calculate exact costs. Pricing data is fetched live from OpenRouter and cached for 24 hours.

Loading models from OpenRouter…
ℹ️ For reference only. Prices are sourced from OpenRouter and may differ from what you are billed directly by each provider. Always check the provider's official pricing page before budgeting production workloads.

About This Calculator

The AI Model Cost Calculator fetches live pricing data from OpenRouter, an API aggregator that provides access to 100+ large language models (LLMs) from providers including OpenAI, Anthropic, Google DeepMind, Meta, Mistral, Cohere and others. Select any model, enter your expected token usage, and instantly see the cost breakdown.

How AI API Pricing Works

AI language model APIs charge separately for two types of tokens:

  • Input tokens — the text you send to the model (your prompt, context, chat history, documents). These are cheaper because the model processes them in one pass.
  • Output tokens — the text the model generates in response. These are more expensive because generation is computed token by token.

Prices are quoted per 1 million (1M) tokens. To estimate your cost: multiply your token count by the per-token price. This calculator does that automatically for any model on OpenRouter.

What Is a Token?

A token is roughly 4 characters or ¾ of a word in English. As a rough guide:

  • A short sentence (~10 words) ≈ 13–15 tokens
  • A paragraph (~100 words) ≈ 130–150 tokens
  • A full page of text (~500 words) ≈ 650–750 tokens
  • A 10-page document (~5,000 words) ≈ 6,500–7,500 tokens

Frequently Asked Questions

How is AI model cost calculated? Total cost = (Input Tokens × input price per token) + (Output Tokens × output price per token). Providers quote prices per 1 million tokens. For example, if input costs $2.50/1M and you send 1,000 tokens, the input cost is $0.0025.

What is a token in AI models? A token is a unit of text — roughly 4 characters or ¾ of a word in English. A typical 100-word paragraph is approximately 130–150 tokens. Non-English languages often use more tokens per word.

Why are input and output tokens priced differently? Input tokens are cheaper because the model reads them in a single forward pass. Output tokens require sequential autoregressive generation — each token is computed one at a time — which is significantly more compute-intensive.

Where does the pricing data come from? Pricing is fetched live from the OpenRouter API, which aggregates 100+ models. Data is cached in your browser for 24 hours. OpenRouter's prices may differ slightly from direct provider pricing.

How many tokens does a typical ChatGPT conversation use? A short query and response might use 200–400 tokens total. A detailed technical exchange with a long reply can use 2,000–5,000 tokens or more depending on context window usage.

⚠️
Important Note: This tool is intended to provide estimates and should not be used as a substitute for professional advice. Pricing shown is sourced from OpenRouter and may not reflect direct provider pricing, volume discounts, or enterprise agreements. Always verify costs with your provider before budgeting production workloads.

Related Calculators

  • AI vs Human ROI Calculator — Compare AI tool costs against human labour. Monthly savings, annual ROI and break-even with live model pricing.
  • Prompt Cost Calculator — Paste your prompt, choose a model, and instantly see how many tokens it uses and what it costs to send.
  • LLM Cost Comparator — Side-by-side per-million-token cost for GPT-5, Claude Opus 4.7, Gemini 3.1 Pro, Llama, DeepSeek.