# TokenMonopoly > Live leaderboard of AI API deals, pricing, and SWE-bench scores. Compare the cheapest Claude, GPT, Gemini, Kimi, DeepSeek, Llama and Qwen hosts across 96+ providers. Refreshed daily. Last generated: 2026-04-12T03:58:37.252Z Models tracked: 27 Hosts tracked: 96 ## Core content - [Homepage (interactive leaderboard)](https://tokenmonopoly.com/) - [Full leaderboard in Markdown](https://tokenmonopoly.com/index.md) - [Cheapest AI APIs](https://tokenmonopoly.com/cheapest) - [AI Deals (subscriptions, flat-rate plans, harnesses)](https://tokenmonopoly.com/ai-deals) - [About](https://tokenmonopoly.com/about) Per-model landing pages: `/{modifier}/{model}` where modifier ∈ {`cheapest`, `best`, `most-reliable`}. Examples: - https://tokenmonopoly.com/cheapest/claude-sonnet-4-6 - https://tokenmonopoly.com/cheapest/kimi-k2-5 - https://tokenmonopoly.com/cheapest/llama-3-3-70b-instruct - https://tokenmonopoly.com/best/gpt-5-4 - https://tokenmonopoly.com/most-reliable/deepseek-v3-2 ## Top 15 cheapest models (by lowest input $/MTok) 1. **GPT-5.4 Nano** — $0.20/MTok input at **Azure** · 2 hosts · 69.8% SWE-bench · [https://tokenmonopoly.com/cheapest/gpt-5-4-nano](https://tokenmonopoly.com/cheapest/gpt-5-4-nano) 2. **Gemini 3.1 Flash Lite Preview** — $0.25/MTok input at **Google AI Studio** · 2 hosts · 62.8% SWE-bench · [https://tokenmonopoly.com/cheapest/gemini-3-1-flash-lite-preview](https://tokenmonopoly.com/cheapest/gemini-3-1-flash-lite-preview) 3. **GPT-5 Mini** — $0.25/MTok input at **OpenAI** · 2 hosts · 60.8% SWE-bench · [https://tokenmonopoly.com/cheapest/gpt-5-mini](https://tokenmonopoly.com/cheapest/gpt-5-mini) 4. **MiniMax M2.1** — $0.29/MTok input at **AtlasCloud** · 6 hosts · 74.8% SWE-bench · [https://tokenmonopoly.com/cheapest/minimax-m2-1](https://tokenmonopoly.com/cheapest/minimax-m2-1) 5. **MiniMax M2.7** — $0.30/MTok input at **Minimax** · 1 host · 73.8% SWE-bench · [https://tokenmonopoly.com/cheapest/minimax-m2-7](https://tokenmonopoly.com/cheapest/minimax-m2-7) 6. **Qwen3.6 Plus** — $0.33/MTok input at **Alibaba** · 1 host · 73.4% SWE-bench · [https://tokenmonopoly.com/cheapest/qwen3-6-plus](https://tokenmonopoly.com/cheapest/qwen3-6-plus) 7. **Kimi K2.5** — $0.38/MTok input at **Chutes** · 15 hosts · 70.0% SWE-bench · [https://tokenmonopoly.com/cheapest/kimi-k2-5](https://tokenmonopoly.com/cheapest/kimi-k2-5) 8. **GLM 4.7** — $0.39/MTok input at **Chutes** · 11 hosts · 69.4% SWE-bench · [https://tokenmonopoly.com/cheapest/glm-4-7](https://tokenmonopoly.com/cheapest/glm-4-7) 9. **Devstral 2 2512** — $0.40/MTok input at **Mistral** · 1 host · 62.8% SWE-bench · [https://tokenmonopoly.com/cheapest/devstral-2512](https://tokenmonopoly.com/cheapest/devstral-2512) 10. **Gemini 3 Flash Preview** — $0.50/MTok input at **Google AI Studio** · 2 hosts · 75.0% SWE-bench · [https://tokenmonopoly.com/cheapest/gemini-3-flash-preview](https://tokenmonopoly.com/cheapest/gemini-3-flash-preview) 11. **Kimi K2 0711** — $0.57/MTok input at **Novita** · 1 host · 60.2% SWE-bench · [https://tokenmonopoly.com/cheapest/kimi-k2](https://tokenmonopoly.com/cheapest/kimi-k2) 12. **GLM 5** — $0.72/MTok input at **GMICloud** · 16 hosts · 71.4% SWE-bench · [https://tokenmonopoly.com/cheapest/glm-5](https://tokenmonopoly.com/cheapest/glm-5) 13. **GPT-5.4 Mini** — $0.75/MTok input at **Azure** · 2 hosts · 73.0% SWE-bench · [https://tokenmonopoly.com/cheapest/gpt-5-4-mini](https://tokenmonopoly.com/cheapest/gpt-5-4-mini) 14. **GLM 5.1** — $0.95/MTok input at **Chutes** · 13 hosts · 76.4% SWE-bench · [https://tokenmonopoly.com/cheapest/glm-5-1](https://tokenmonopoly.com/cheapest/glm-5-1) 15. **Claude Haiku 4.5** — $1.00/MTok input at **Amazon Bedrock** · 3 hosts · 66.6% SWE-bench · [https://tokenmonopoly.com/cheapest/claude-haiku-4-5](https://tokenmonopoly.com/cheapest/claude-haiku-4-5) ## Top 10 best value models (SWE-bench per dollar) 1. **MiniMax M2.1** — 74.8% SWE-bench at $0.40/MTok blended via **AtlasCloud** · [https://tokenmonopoly.com/best/minimax-m2-1](https://tokenmonopoly.com/best/minimax-m2-1) 2. **GPT-5.4 Nano** — 69.8% SWE-bench at $0.38/MTok blended via **Azure** · [https://tokenmonopoly.com/best/gpt-5-4-nano](https://tokenmonopoly.com/best/gpt-5-4-nano) 3. **MiniMax M2.7** — 73.8% SWE-bench at $0.45/MTok blended via **Minimax** · [https://tokenmonopoly.com/best/minimax-m2-7](https://tokenmonopoly.com/best/minimax-m2-7) 4. **Gemini 3.1 Flash Lite Preview** — 62.8% SWE-bench at $0.46/MTok blended via **Google AI Studio** · [https://tokenmonopoly.com/best/gemini-3-1-flash-lite-preview](https://tokenmonopoly.com/best/gemini-3-1-flash-lite-preview) 5. **Qwen3.6 Plus** — 73.4% SWE-bench at $0.60/MTok blended via **Alibaba** · [https://tokenmonopoly.com/best/qwen3-6-plus](https://tokenmonopoly.com/best/qwen3-6-plus) 6. **Kimi K2.5** — 70.0% SWE-bench at $0.61/MTok blended via **Chutes** · [https://tokenmonopoly.com/best/kimi-k2-5](https://tokenmonopoly.com/best/kimi-k2-5) 7. **GLM 4.7** — 69.4% SWE-bench at $0.62/MTok blended via **Chutes** · [https://tokenmonopoly.com/best/glm-4-7](https://tokenmonopoly.com/best/glm-4-7) 8. **GPT-5 Mini** — 60.8% SWE-bench at $0.54/MTok blended via **OpenAI** · [https://tokenmonopoly.com/best/gpt-5-mini](https://tokenmonopoly.com/best/gpt-5-mini) 9. **Devstral 2 2512** — 62.8% SWE-bench at $0.67/MTok blended via **Mistral** · [https://tokenmonopoly.com/best/devstral-2512](https://tokenmonopoly.com/best/devstral-2512) 10. **Gemini 3 Flash Preview** — 75.0% SWE-bench at $0.92/MTok blended via **Google AI Studio** · [https://tokenmonopoly.com/best/gemini-3-flash-preview](https://tokenmonopoly.com/best/gemini-3-flash-preview) ## Curated subscription deals - **Fireworks Fire Pass** — $7/wk · Kimi K2.5 Turbo · Unlimited tokens (RPM throttled) · BYOH. Flat weekly rate for high-volume Kimi K2.5 Turbo access. API key works in any OpenAI-compatible harness. - **Synthetic.new — 1 Pack** — $30/mo · Claude-class frontier · Unlimited tokens, 135 concurrent requests · BYOH. Flat monthly rate giving API access to Claude-class frontier models. Drop the key into any agent framework. - **MiniMax Token Plan — Starter** — $10/mo · MiniMax M2.7 · Starter tier — fraction of Plus window · BYOH. Entry tier of MiniMax's flat-rate API bundle. Uses standard API keys so any harness works. - **MiniMax Token Plan — Plus** — $20/mo · MiniMax M2.7 + speech/image · 4,500 req / 5hr · BYOH. Mid-tier MiniMax subscription with 4,500 requests per 5-hour window. Best price/volume ratio in the flat-rate bucket. - **MiniMax Token Plan — Max** — $50/mo · MiniMax M2.7 + all modalities · 15,000 req / 5hr · BYOH. Top MiniMax tier, 15,000 requests per 5-hour window plus all modalities. For continuous agent workflows. - **OpenCode Zen** — PAYG ~$20 topup · Kimi K2.5, GPT-5-Codex, Gemini, Claude · PAYG — provider cost passthrough · BYOH. Pay-as-you-go gateway to multiple frontier models at zero markup. No monthly fee — just top up and use. - **Claude Pro** — $20/mo · Claude Sonnet 4.6 · ~45 Sonnet msgs / 5hr · BYOH. Standard Claude chat subscription. Claude Code subscription auth is reusable by OpenCode, Cline, and other harnesses. Some third-party restrictions apply on Max tokens. - **Claude Max 5x** — $100/mo · Claude Sonnet 4.6, Opus 4.6 · ~225 Sonnet msgs / 5hr · BYOH. ~225 Sonnet messages per 5-hour window (5× Claude Pro). Claude Code subscription auth reusable by OpenCode, Cline, and other harnesses — though Anthropic restricts some third-party consumers of Max tokens. - **Claude Max 20x** — $200/mo · Claude Sonnet 4.6, Opus 4.6 · ~900 Sonnet msgs / 5hr · BYOH. ~900 Sonnet messages per 5-hour window (20× Claude Pro). Heaviest Claude Code tier for full-day Opus runs. Subscription auth reusable by OpenCode / Cline with some third-party restrictions. - **ChatGPT Plus** — $20/mo · GPT-5, o3, o4-mini, GPT-5 Codex · 600–3,000 Codex local msgs / 5hr · BYOH. Baseline Codex tier. Token-based 5-hour windows (since Apr 2 2026). Codex OAuth works in OpenCode, ForgeCode, Cline and other OpenAI-compatible harnesses. - **ChatGPT Pro ($100)** — $100/mo · GPT-5, o3, o4-mini, GPT-5 Codex · 3,000–15,000 Codex local msgs / 5hr (5× Plus) · BYOH. New $100 Pro tier (Apr 9 2026). 5× Codex of Plus — built for heavy agentic coding sessions. Same model access as $200 Pro. 10× boost through May 31 promo. - **ChatGPT Pro ($200)** — $200/mo · GPT-5, o3, o4-mini, GPT-5 Codex · 12,000–60,000 Codex local msgs / 5hr (20× Plus) · BYOH. Maximum Codex throughput — 20× Plus limits. Unlimited GPT-5 Instant/Thinking, 250 Deep Research runs/mo. Same harness OAuth as other tiers. ## Tracked hosts - [Anthropic](https://www.anthropic.com/pricing): live pricing source - [OpenAI](https://openai.com/api/pricing): live pricing source - [Google AI Studio](https://ai.google.dev/pricing): live pricing source - [Fireworks AI](https://fireworks.ai/pricing): live pricing source - [Groq](https://groq.com/pricing): live pricing source - [DeepSeek](https://api-docs.deepseek.com/quick_start/pricing): live pricing source - [Cerebras](https://cloud.cerebras.ai/pricing): live pricing source - [Z Ai](): live pricing source - [Qwen](): live pricing source - [Arcee Ai](): live pricing source - [Xai](): live pricing source - [Kwaipilot](): live pricing source - [Rekaai](): live pricing source - [Xiaomi](): live pricing source - [Minimax](): live pricing source - [Mistral](): live pricing source - [Nvidia](): live pricing source - [Bytedance Seed](): live pricing source - [Inception](): live pricing source - [Liquid](): live pricing source - [Aion Labs](): live pricing source - [Openrouter](): live pricing source - [Stepfun](): live pricing source - [Moonshot](): live pricing source - [Upstage](): live pricing source - [Writer](): live pricing source - [Allenai](): live pricing source - [Relace](): live pricing source - [Nex Agi](): live pricing source - [Essentialai](): live pricing source - [Amazon](): live pricing source - [Prime Intellect](): live pricing source - [Deepcogito](): live pricing source - [Perplexity](): live pricing source - [Ibm Granite](): live pricing source - [Baidu](): live pricing source - [Thedrummer](): live pricing source - [Meituan](): live pricing source - [Nousresearch](): live pricing source - [Ai21](): live pricing source ## Disclaimer TokenMonopoly is an independent, non-commercial price directory with no affiliate relationships and no advertising. Provider names, model names, and trademarks are the property of their respective owners and are used nominatively only. Prices are best-effort — always verify on the provider's own page before buying. Nothing here is financial or purchasing advice.