# TokenMonopoly — AI coding deals leaderboard

> Live pricing and SWE-bench scores for AI APIs used in coding harnesses. Prices refreshed daily.

- Last updated: 2026-04-12T04:00:06.383Z
- Providers tracked: 96
- Models listed: 27
- Total host offers: 706
- Canonical HTML view: https://tokenmonopoly.com/

## Methodology

- **Input / Output $/MTok** — pay-per-token is direct; subscription offers are normalized to an effective rate assuming 5M input + 1M output tokens/day.
- **SWE-bench** — accuracy scores scraped from vals.ai's public SWE-bench leaderboard and fuzzy-matched to each model's canonical name.
- **Price/perf score** — `SWE-bench ÷ effective blended $/MTok`. Higher is better. Models with no SWE-bench score are ranked last.
- **Hosts** — most models are served by multiple hosts (e.g. Anthropic, Bedrock, Vertex; or DeepInfra, Together, Groq for open models). The main table shows the cheapest host per model and the host count; per-host breakdowns follow.

## Ranked models

| # | Model | Cheapest Host | Hosts | SWE-bench | Input $/MTok | Output $/MTok | Save | Type |
|---|-------|---------------|------:|----------:|-------------:|--------------:|-----:|------|
| 1 | MiniMax M2.1 | AtlasCloud | 6 | 74.8% | $0.29 | $0.95 | −26% | closed |
| 2 | GPT-5.4 Nano | Azure | 2 | 69.8% | $0.20 | $1.25 | — | closed |
| 3 | MiniMax M2.7 | Minimax | 1 | 73.8% | $0.30 | $1.20 | — | closed |
| 4 | Gemini 3.1 Flash Lite Preview | Google AI Studio | 2 | 62.8% | $0.25 | $1.50 | — | closed |
| 5 | Qwen3.6 Plus | Alibaba | 1 | 73.4% | $0.33 | $1.95 | — | open |
| 6 | Kimi K2.5 | Chutes | 15 | 70.0% | $0.38 | $1.72 | −42% | open |
| 7 | GLM 4.7 | Chutes | 11 | 69.4% | $0.39 | $1.75 | −74% | closed |
| 8 | GPT-5 Mini | OpenAI | 2 | 60.8% | $0.25 | $2.00 | — | closed |
| 9 | Devstral 2 2512 | Mistral | 1 | 62.8% | $0.40 | $2.00 | — | open |
| 10 | Gemini 3 Flash Preview | Google AI Studio | 2 | 75.0% | $0.50 | $3.00 | — | closed |
| 11 | GLM 5 | GMICloud | 16 | 71.4% | $0.72 | $0.60 | −41% | closed |
| 12 | Kimi K2 0711 | Novita | 1 | 60.2% | $0.57 | $2.30 | — | open |
| 13 | GLM 5.1 | Chutes | 13 | 76.4% | $0.95 | $3.15 | −45% | closed |
| 14 | GPT-5.4 Mini | Azure | 2 | 73.0% | $0.75 | $4.50 | — | closed |
| 15 | Claude Haiku 4.5 | Amazon Bedrock | 3 | 66.6% | $1.00 | $5.00 | — | closed |
| 16 | GPT-5.1 | Azure | 2 | 69.8% | $1.25 | $10.00 | — | closed |
| 17 | GPT-5 | Azure | 2 | 69.0% | $1.25 | $10.00 | — | closed |
| 18 | Gemini 3.1 Pro Preview | Google AI Studio | 2 | 78.8% | $2.00 | $12.00 | — | closed |
| 19 | GPT-5.3-Codex | OpenAI | 2 | 78.0% | $1.75 | $14.00 | — | closed |
| 20 | Gemini 2.5 Pro | Google AI Studio | 2 | 54.4% | $1.25 | $10.00 | — | closed |
| 21 | GPT-5.2 | Azure | 2 | 75.8% | $1.75 | $14.00 | — | closed |
| 22 | GPT-5.2-Codex | Azure | 2 | 72.4% | $1.75 | $14.00 | — | closed |
| 23 | GPT-5.4 | Azure | 2 | 78.2% | $2.50 | $15.00 | — | closed |
| 24 | Claude Sonnet 4.6 | Anthropic | 4 | 77.4% | $3.00 | $15.00 | — | closed |
| 25 | Claude Sonnet 4.5 | Google AI Studio | 3 | 70.0% | $3.00 | $15.00 | — | closed |
| 26 | Claude Opus 4.6 | Amazon Bedrock | 4 | 78.2% | $5.00 | $25.00 | — | closed |
| 27 | Claude Opus 4.5 | Anthropic | 3 | 76.4% | $5.00 | $25.00 | — | closed |

## Host breakdown

### MiniMax M2.1

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| AtlasCloud | $0.29 | $0.95 | fp8 | 97.5% |
| Novita | $0.30 | $1.20 | fp8 | 100.0% |
| Nebius | $0.30 | $1.20 | fp8 | — |
| Fireworks AI | $0.30 | $1.20 | — | — |
| Minimax | $0.30 | $1.20 | fp8 | — |
| Venice | $0.35 | $1.50 | — | — |

### GPT-5.4 Nano

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $0.20 | $1.25 | — | 100.0% |
| OpenAI | $0.20 | $1.25 | — | — |

### Gemini 3.1 Flash Lite Preview

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $0.25 | $1.50 | — | 99.4% |
| Google AI Studio | $0.25 | $1.50 | — | 99.4% |

### Kimi K2.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Chutes | $0.38 | $1.72 | int4 | 97.9% |
| Io Net | $0.38 | $1.72 | int4 | 97.5% |
| Inceptron | $0.44 | $2.20 | int4 | 100.0% |
| DeepInfra | $0.45 | $2.25 | — | 99.2% |
| SiliconFlow | $0.45 | $2.25 | int4 | 100.0% |
| AtlasCloud | $0.49 | $2.50 | int4 | 99.6% |
| Together | $0.50 | $2.80 | — | 97.6% |
| Parasail | $0.60 | $2.80 | int4 | 99.8% |
| Novita | $0.57 | $2.85 | — | 100.0% |
| Phala | $0.60 | $3.00 | — | 89.3% |
| Moonshot AI | $0.60 | $3.00 | int4 | 99.8% |
| BaseTen | $0.60 | $3.00 | fp4 | — |
| Fireworks AI | $0.60 | $3.00 | — | — |
| ModelRun | $0.55 | $3.25 | fp4 | 100.0% |
| Venice | $0.56 | $3.50 | — | 93.4% |

### GLM 4.7

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Chutes | $0.39 | $1.75 | bf16 | 100.0% |
| DeepInfra | $0.40 | $1.75 | fp4 | 99.3% |
| AtlasCloud | $0.52 | $1.85 | fp8 | 100.0% |
| Nebius | $0.40 | $2.00 | fp8 | 100.0% |
| Novita | $0.54 | $1.98 | fp8 | 98.7% |
| Parasail | $0.45 | $2.10 | fp8 | 100.0% |
| SiliconFlow | $0.45 | $2.20 | fp8 | 100.0% |
| Z Ai | $0.60 | $2.20 | — | 99.5% |
| Google AI Studio | $0.60 | $2.20 | — | 100.0% |
| Venice | $0.55 | $2.65 | fp4 | 100.0% |
| Cerebras | $2.25 | $2.75 | fp16 | 100.0% |

### GPT-5 Mini

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $0.25 | $2.00 | — | 100.0% |
| Azure | $0.25 | $2.00 | — | — |

### Gemini 3 Flash Preview

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $0.50 | $3.00 | — | 99.7% |
| Google AI Studio | $0.50 | $3.00 | — | 99.5% |

### GLM 5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| GMICloud | $1.00 | $0.60 | fp8 | 99.8% |
| Ambient | $0.72 | $2.30 | fp8 | 99.0% |
| DeepInfra | $0.80 | $2.56 | fp4 | 99.8% |
| SiliconFlow | $0.95 | $2.55 | fp8 | 99.9% |
| BaseTen | $0.95 | $3.15 | fp4 | — |
| AtlasCloud | $0.95 | $3.15 | fp8 | 100.0% |
| Chutes | $0.95 | $3.15 | fp8 | — |
| Parasail | $1.00 | $3.20 | fp8 | 100.0% |
| Venice | $1.00 | $3.20 | fp8 | 99.7% |
| Friendli | $1.00 | $3.20 | — | 100.0% |
| Novita | $1.00 | $3.20 | fp8 | 100.0% |
| StreamLake | $1.00 | $3.20 | — | 100.0% |
| Z Ai | $1.00 | $3.20 | — | 99.9% |
| Together | $1.00 | $3.20 | — | 98.1% |
| Fireworks AI | $1.00 | $3.20 | — | — |
| Phala | $1.20 | $3.50 | — | 93.2% |

### GLM 5.1

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Chutes | $0.95 | $3.15 | fp8 | 77.8% |
| Io Net | $1.40 | $4.40 | fp8 | 99.9% |
| GMICloud | $1.40 | $4.40 | fp8 | 97.5% |
| Novita | $1.40 | $4.40 | fp8 | 100.0% |
| DeepInfra | $1.40 | $4.40 | fp8 | 97.3% |
| Parasail | $1.40 | $4.40 | fp8 | 97.2% |
| Together | $1.40 | $4.40 | — | 99.6% |
| Fireworks AI | $1.40 | $4.40 | — | 99.8% |
| AtlasCloud | $1.40 | $4.40 | fp8 | 100.0% |
| Z Ai | $1.40 | $4.40 | — | 99.4% |
| Friendli | $1.40 | $4.40 | — | 100.0% |
| SiliconFlow | $1.40 | $4.40 | fp8 | 100.0% |
| Venice | $1.75 | $5.50 | fp8 | 96.3% |

### GPT-5.4 Mini

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $0.75 | $4.50 | — | 100.0% |
| OpenAI | $0.75 | $4.50 | — | — |

### Claude Haiku 4.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Amazon Bedrock | $1.00 | $5.00 | — | 99.9% |
| Google AI Studio | $1.00 | $5.00 | — | 100.0% |
| Anthropic | $1.00 | $5.00 | — | 100.0% |

### GPT-5.1

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $1.25 | $10.00 | — | 100.0% |
| OpenAI | $1.25 | $10.00 | — | 99.9% |

### GPT-5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $1.25 | $10.00 | — | — |
| OpenAI | $1.25 | $10.00 | — | 100.0% |

### Gemini 3.1 Pro Preview

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $2.00 | $12.00 | — | 99.8% |
| Google AI Studio | $2.00 | $12.00 | — | 99.1% |

### GPT-5.3-Codex

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| OpenAI | $1.75 | $14.00 | — | 99.5% |
| Azure | $1.75 | $14.00 | — | 100.0% |

### Gemini 2.5 Pro

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $1.25 | $10.00 | — | 96.9% |
| Google AI Studio | $1.25 | $10.00 | — | 93.2% |

### GPT-5.2

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $1.75 | $14.00 | — | 100.0% |
| OpenAI | $1.75 | $14.00 | — | 99.7% |

### GPT-5.2-Codex

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $1.75 | $14.00 | — | 100.0% |
| OpenAI | $1.75 | $14.00 | — | 99.1% |

### GPT-5.4

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Azure | $2.50 | $15.00 | — | 100.0% |
| OpenAI | $2.50 | $15.00 | — | — |

### Claude Sonnet 4.6

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Anthropic | $3.00 | $15.00 | — | 100.0% |
| Google AI Studio | $3.00 | $15.00 | — | 100.0% |
| Azure | $3.00 | $15.00 | — | — |
| Amazon Bedrock | $3.00 | $15.00 | — | 99.8% |

### Claude Sonnet 4.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Google AI Studio | $3.00 | $15.00 | — | 100.0% |
| Anthropic | $3.00 | $15.00 | — | 99.7% |
| Amazon Bedrock | $3.00 | $15.00 | — | 99.9% |

### Claude Opus 4.6

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Amazon Bedrock | $5.00 | $25.00 | — | 99.9% |
| Azure | $5.00 | $25.00 | — | — |
| Google AI Studio | $5.00 | $25.00 | — | 100.0% |
| Anthropic | $5.00 | $25.00 | — | 100.0% |

### Claude Opus 4.5

| Host | Input $/MTok | Output $/MTok | Quant | Uptime 30m |
|------|-------------:|--------------:|-------|-----------:|
| Anthropic | $5.00 | $25.00 | — | 100.0% |
| Amazon Bedrock | $5.00 | $25.00 | — | 99.9% |
| Google AI Studio | $5.00 | $25.00 | — | 99.8% |


## Subscription deals

_None currently tracked._
