Most reliable GLM 4.7 API providers
15 providers host GLM 4.7. Most reliable over the last 30 minutes is Chutes at 100.00% uptime.
What would GLM 4.7 cost you?
StreamLake is 74% cheaper than Cerebras at this workload.
Input tokens / month150M
Output tokens / month30M
StreamLake
$109 /mo
DekaLLM
$109 /mo
Chutes
$111 /mo
DeepInfra
$113 /mo
Nebius
$120 /mo
Parasail
$131 /mo
AtlasCloud
$134 /mo
SiliconFlow
$134 /mo
Novita
$140 /mo
Google AI Studio
$156 /mo
Z Ai
$156 /mo
Fireworks AI
$156 /mo
Venice
$162 /mo
Phala
$227 /mo
Cerebras
$420 /mo
Projected monthly cost = (input price × 150M) + (output price × 30M). Drag the sliders to match your actual workload; the chart re-ranks live.
| # | Host | Context | Input $/MTok | Output $/MTok | Blended | Uptime 30m | Quant |
|---|---|---|---|---|---|---|---|
| 1 | Chutes | 203k | $0.39 | $1.75 | $0.62 | 100.00% | bf16 |
| 2 | Parasail | 203k | $0.45 | $2.10 | $0.72 | 100.00% | fp8 |
| 3 | AtlasCloud | 203k | $0.52 | $1.85 | $0.74 | 100.00% | fp8 |
| 4 | Google AI Studio | 200k | $0.60 | $2.20 | $0.87 | 100.00% | — |
| 5 | Cerebras | 131k | $2.25 | $2.75 | $2.33 | 100.00% | fp16 |
| 6 | StreamLake | 200k | $0.42 | $1.54 | $0.61 | 99.87% | — |
| 7 | DeepInfra | 203k | $0.40 | $1.75 | $0.63 | 99.79% | fp4 |
| 8 | SiliconFlow | 205k | $0.45 | $2.20 | $0.74 | 97.69% | fp8 |
| 9 | Z Ai | 200k | $0.60 | $2.20 | $0.87 | 96.22% | — |
| 10 | Novita | 205k | $0.54 | $1.98 | $0.78 | 95.49% | fp8 |
| 11 | Nebius | 203k | $0.40 | $2.00 | $0.67 | 93.90% | fp8 |
| 12 | Fireworks AI | — | $0.60 | $2.20 | $0.87 | — | — |
| 13 | DekaLLM | 203k | $0.38 | $1.74 | $0.61 | — | fp4 |
| 14 | Venice | 198k | $0.55 | $2.65 | $0.90 | — | fp4 |
| 15 | Phala | 131k | $0.85 | $3.30 | $1.26 | — | — |