Skip to main content

Cheapest Chat LLM APIs in 2026

Chat and conversational LLM APIs ranked by price. Find the cheapest option for chatbots, assistants, and text generation.

What is the cheapest chat LLM?

The cheapest chat LLM API is Japanese Stable Diffusion XL via Fireworks AI at $0.0001 per million tokens. 200 chat models are compared across all providers.

Chat Models by Price

200 models
#ModelInputOutput
164Allenai Olmocr 2 7B 1025
$0.090$0.190
200Zai Org GLM 4.7 Flash
$0.060$0.400
196GPT 5 Nano
$0.050$0.400
165Qwen: Qwen3 Coder 30B A3B Instruct
30BOpen
$0.070$0.260
61Voxtral Mini 3B 2507
$0.040$0.040
44Paddlepaddle Paddleocr VL
$0.020$0.020
168Baidu: ERNIE 4.5 21B A3B
21B
$0.070$0.280
169Baidu: ERNIE 4.5 21B A3B Thinking
21B
$0.070$0.280
90Baichuan Baichuan M2 32B
$0.070$0.070
140Minimax M1 80K
$0.100$0.100
87DeepSeek R1 0528 Qwen3 8B
$0.060$0.090
75OpenAI: gpt-oss-20b
20B
$0.030$0.140
95OpenAI: gpt-oss-120b
120B
$0.039$0.190
108OpenAI: gpt-oss-120b (exacto)
120B
$0.047$0.228
158OpenAI: gpt-oss-safeguard-20b
20B
$0.070$0.200
184GPT Oss 20B 1 0
$0.070$0.300
12Qwen3 Embedding 0 6B Batch
$0.0050Free
28Qwen3 Reranker 0 6B
$0.010Free
31Qwen3 Embedding 4B Batch
$0.010Free
33Qwen3 Embedding 8B
$0.010Free
35Qwen3 Embedding 0 6B
$0.010Free
40Qwen3 Embedding 4B
$0.020Free
41Qwen3 Reranker 4B
$0.025Free
52Qwen3 4B Fp8
$0.030$0.030
53Qwen3 Embedding 8B Batch
$0.040Free
57Qwen3 Reranker 8B
$0.050Free
81Qwen3 8B Fp8
$0.035$0.138
96Qwen3 235B A22b Instruct 2507
$0.071$0.100
105qwen-turbo
$0.050$0.200
109Qwen: Qwen3 235B A22B Instruct 2507
235BOpen
$0.085$0.120
118Qwen3 1.7b Fp8 Draft
$0.100$0.100
121Qwen3 1.7b
$0.100$0.100
135Qwen3 1.7b Fp8 Draft 40960
$0.100$0.100
137Qwen3 1.7b Fp8 Draft 131072
$0.100$0.100
146Qwen3 0.6b
$0.100$0.100
183Qwen: Qwen3 14B
14BOpen
$0.072$0.288
185Qwen: Qwen3 30B A3B
30BOpen
$0.080$0.280
187Qwen: Qwen3 32B
32BOpen
$0.080$0.280
195Llama 4 Scout 17B 16e Instruct
$0.080$0.300
124Cogito V1 Preview Llama 3B
$0.100$0.100
159AllenAI: Olmo 2 32B Instruct
32B
$0.060$0.240
48Google: Gemma 3n 4B
4B
$0.020$0.040
85Phi 4 Multimodal Instruct
$0.050$0.100
192Phi-4-mini-instruct
$0.075$0.300
199Phi-4-mini-reasoning
$0.080$0.320
188Gemini 2.0 Flash Lite Preview 02 05
$0.075$0.300
189Gemini 2.0 Flash Lite 001
$0.075$0.300
191Gemini 2.0 Flash Lite
$0.075$0.300
97Command R7b 12 2024
7B
$0.045$0.180
51DeepSeek Ocr
$0.030$0.030

* Prices include ~20% OpenRouter platform fee (verified against actual billing).

Frequently Asked Questions

The cheapest chat LLM API is Japanese Stable Diffusion XL via Fireworks AI at $0.0001 per million tokens (blended rate). There are 200 chat models available.