Skip to main content

Cheapest LLM APIs in 2026 — Ranked by Price Per Token

Every LLM API ranked by blended price per million tokens. Find the lowest-cost model for your use case across 200+ options.

What is the cheapest LLM API right now?

The cheapest LLM API is Japanese Stable Diffusion XL via Fireworks AI at $0.0001 per million tokens (blended rate). This ranking covers 200+ models across 23 providers, updated every 6 hours.

All Models by Price

200 models
#ModelInputOutput
183Nova 2 Multimodal Embeddings
$0.135Free
193Allenai Olmocr 2 7B 1025
$0.090$0.190
194Qwen: Qwen3 Coder 30B A3B Instruct
30BOpen
$0.070$0.260
67Voxtral Mini 3B 2507
$0.040$0.040
49Paddlepaddle Paddleocr VL
$0.020$0.020
198Baidu: ERNIE 4.5 21B A3B
21B
$0.070$0.280
199Baidu: ERNIE 4.5 21B A3B Thinking
21B
$0.070$0.280
98Baichuan Baichuan M2 32B
$0.070$0.070
164Minimax M1 80K
$0.100$0.100
95DeepSeek R1 0528 Qwen3 8B
$0.060$0.090
82OpenAI: gpt-oss-20b
20B
$0.030$0.140
115OpenAI: gpt-oss-120b
120B
$0.039$0.190
130OpenAI: gpt-oss-120b (exacto)
120B
$0.047$0.228
184OpenAI: gpt-oss-safeguard-20b
20B
$0.070$0.200
13Qwen3 Embedding 0 6B Batch
$0.0050Free
30Qwen3 Reranker 0 6B
$0.010Free
33Qwen3 Embedding 4B Batch
$0.010Free
36Qwen3 Embedding 8B
$0.010Free
38Qwen3 Embedding 0 6B
$0.010Free
44Qwen3 Embedding 4B
$0.020Free
46Qwen3 Reranker 4B
$0.025Free
58Qwen3 4B Fp8
$0.030$0.030
59Qwen3 Embedding 8B Batch
$0.040Free
63Qwen3 Reranker 8B
$0.050Free
88Qwen3 8B Fp8
$0.035$0.138
116Qwen3 235B A22b Instruct 2507
$0.071$0.100
126qwen-turbo
$0.050$0.200
131Qwen: Qwen3 235B A22B Instruct 2507
235BOpen
$0.085$0.120
141Qwen3 1.7b Fp8 Draft
$0.100$0.100
144Qwen3 1.7b
$0.100$0.100
159Qwen3 1.7b Fp8 Draft 40960
$0.100$0.100
161Qwen3 1.7b Fp8 Draft 131072
$0.100$0.100
170Qwen3 0.6b
$0.100$0.100
129Embed V 4.0
$0.120Free
147Cogito V1 Preview Llama 3B
$0.100$0.100
185AllenAI: Olmo 2 32B Instruct
32B
$0.060$0.240
190Gemini Embedding 001
$0.150Free
54Google: Gemma 3n 4B
4B
$0.020$0.040
72Google: Gemma 3 4B
4B
$0.040$0.080
92Google: Gemma 3 12B
12B
$0.040$0.130
155Google: Gemma 3 27B
27B
$0.080$0.160
120Gte Modernbert Base
$0.080$0.080
93Phi 4 Multimodal Instruct
$0.050$0.100
117Command R7b 12 2024
7B
$0.045$0.180
57DeepSeek Ocr
$0.030$0.030
146DeepSeek R1 Distill Qwen 1.5b
$0.100$0.100
89Zai Org Autoglm Phone 9B Multilingual
$0.035$0.138
123Microsoft: Phi 4
$0.070$0.140
90Nova Micro
$0.035$0.140
91Nova Micro V1
$0.035$0.140

* Prices include ~20% OpenRouter platform fee (verified against actual billing).

Frequently Asked Questions

The cheapest LLM API is Japanese Stable Diffusion XL via Fireworks AI at $0.0001 per million tokens. Prices are compared using a blended rate (3:1 input-to-output ratio) across 200+ models.