Open Calculator →
📊 Updated May 2026

LLM API Pricing — Every Model, Side by Side

30+ models from 8 providers. Sort by price, filter by tier. Find the cheapest LLM API for your workload in under 60 seconds.

Notified when any model changes price. No spam.
✓ You're on the list
8
Providers tracked
35+
Models listed
$0.02
Cheapest (per 1M tokens)
2M
Max context window
All
Budget
Mid-tier
Standard
Premium
Reasoning
Click column headers to sort ↕
Provider Model Input $/1M Output $/1M Context Tier Best For Compare

Prices sourced directly from provider API documentation. Updated May 2026. See changelog →

Deep-Dive Comparisons

GPT-4o vs Claude Sonnet
Head-to-head on quality, price, and use cases. The two most popular production APIs compared.
Read comparison →
Cheapest AI Models Ranked
33 models ranked by price. Find the lowest-cost API for your workload with full cost analysis.
See rankings →
GPT-4o Mini vs Claude Haiku
The two dominant budget models. $0.15/M vs $0.80/M — a 5x cost gap with real quality tradeoffs.
Read comparison →
Claude Sonnet vs Gemini 2.5 Pro
Premium-tier battle. Claude at $3.00/M vs Gemini at $1.25/M — when does the price premium pay off?
Read comparison →
Google Gemini vs Anthropic Claude
Full model family comparison. 5 Gemini models vs 4 Claude models with benchmark scores.
Read comparison →
Open Source vs Proprietary LLMs
Llama, Mistral, DeepSeek vs GPT, Claude, Gemini. Self-hosting TCO break-even analysis included.
Read comparison →

Frequently Asked Questions

Which LLM API is cheapest in 2026? +
Mistral Nemo is the cheapest at $0.02/M input tokens — but it's a lightweight model for simple tasks. Among capable production models: Gemini 2.5 Flash-Lite and GPT-4.1 Nano both hit $0.10/M input. DeepSeek V3.2 delivers GPT-4-level quality at $0.14/M. For reasoning tasks, DeepSeek R1-Distill provides state-of-the-art chain-of-thought at $0.14/M — roughly 35x cheaper than OpenAI o3. See our full cheapest models ranking for the complete list.
How much does the OpenAI API cost per 1M tokens in 2026? +
OpenAI API prices span a wide range. GPT-4.1 Nano is cheapest at $0.10/M input, $0.40/M output. GPT-4o Mini costs $0.15/M input, $0.60/M output. GPT-5 Mini is $0.30/M input, $1.20/M output. GPT-5.4 (the flagship) costs $2.50/M input, $15.00/M output. The premium o3-mini reasoning model costs $5.00/M input and $20.00/M output.
Is Claude API cheaper than GPT in 2026? +
At the budget tier, no — Claude Haiku 4.5 ($0.80/M input) is significantly more expensive than GPT-4o Mini ($0.15/M) or GPT-4.1 Nano ($0.10/M). At the standard tier, they're comparable: Claude Sonnet 4.6 costs $3.00/M vs GPT-5.4 at $2.50/M. At the premium tier, Claude Opus 4.6 ($5.00/M) is cheaper than GPT-5.4 Pro ($30.00/M). Anthropic competes on quality and reliability, not price. Compare them directly at our GPT-4o vs Claude comparison.
What LLM has the largest context window? +
Gemini 3.1 Pro and several xAI Grok models (Grok 4.1 Fast, Grok 4.2 Fast, Grok 4 Fast) offer the largest context windows at 2M tokens. Google Gemini 2.5 Flash, Gemini 2.5 Flash-Lite, and Gemini 2.5 Pro all support 1M tokens. Claude Opus 4.6 and Claude Sonnet 4.6 support 1M tokens. DeepSeek V4 also supports 1M tokens at an aggressive price point of $0.30/M input.
Which LLM API is best for production apps in 2026? +
The best choice depends on your volume and quality requirements. For high-volume / cost-sensitive workloads: GPT-4.1 Nano ($0.10/M) or Gemini 2.5 Flash-Lite ($0.10/M). For balanced quality + cost: Gemini 2.5 Flash ($0.30/M), GPT-5 Mini ($0.30/M), or DeepSeek V4 ($0.30/M). For production-quality with reliability guarantees: Claude Sonnet 4.6 ($3.00/M) or GPT-5.4 ($2.50/M). For reasoning / math / code: DeepSeek R1 ($0.55/M) is a standout value. Use our interactive calculator to model your specific token volumes.
How do I calculate my monthly LLM API cost? +
Monthly cost = (daily input tokens × input price/token + daily output tokens × output price/token) × 30. Since prices are per 1M tokens, divide by 1,000,000. Example: 1M input + 200K output tokens per day with GPT-5 Mini ($0.30/M input, $1.20/M output) = ($0.30 × 1 + $1.20 × 0.2) × 30 = ($0.30 + $0.24) × 30 = $16.20/month. Use our free AI cost calculator to model any volume and compare providers side-by-side automatically.

📬 Get weekly LLM pricing updates

AI model prices change frequently — OpenAI, Anthropic, and Google all updated pricing multiple times in 2025. Get notified the moment any model changes price.