Skip to main content

Cheapest Open-Weight / Open Source LLM APIs in 2026

Open-weight LLMs you can self-host or access through inference providers. Compare pricing for Llama, Mistral, Qwen, and other open models.

What is the cheapest open-weight / open source LLM?

The cheapest open-weight / open source LLM API is Meta: Llama 3.2 1B Instruct via Deepinfra at $0.0063 per million tokens. 94 open weight models are compared across all providers.

Open-Weight / Open Source Models by Price

94 models
#ModelInputOutput
38DeepSeek: DeepSeek V3.2 Exp
Open
$0.210$0.320
16Qwen: Qwen3 Coder 30B A3B Instruct
30BOpen
$0.070$0.260
49Qwen: Qwen3 Coder Next
Open
$0.144$0.900
30Mistral: Voxtral Small 24B 2507
24BOpen
$0.120$0.360
27Mistral: Devstral Small 1.1
Open
$0.120$0.360
14Qwen: Qwen3 235B A22B Instruct 2507
235BOpen
$0.085$0.120
18Qwen: Qwen3 14B
14BOpen
$0.072$0.288
19Qwen: Qwen3 30B A3B
30BOpen
$0.080$0.280
20Qwen: Qwen3 32B
32BOpen
$0.080$0.280
22Qwen: Qwen3 30B A3B Thinking 2507
30BOpen
$0.061$0.408
23Qwen: Qwen3 8B
8BOpen
$0.060$0.480
24Qwen: Qwen3 30B A3B Instruct 2507
30BOpen
$0.108$0.360
31Qwen: Qwen3 VL 8B Instruct
8BOpen
$0.080$0.500
35Qwen: Qwen3 VL 32B Instruct
32BOpen
$0.125$0.499
43Qwen: Qwen3 VL 30B A3B Instruct
30BOpen
$0.150$0.600
45Qwen: Qwen3 235B A22B
235BOpen
$0.180$0.540
50Qwen: Qwen3 Next 80B A3B Instruct
80BOpen
$0.090$1.10
34Qwen: QwQ 32B
32BOpen
$0.150$0.400
44DeepSeek: R1 Distill Qwen 32B
32BOpen
$0.270$0.270
25NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
49BOpen
$0.100$0.400
32Meta: Llama 3.3 70B Instruct
70BOpen
$0.120$0.384
42Qwen2.5 Coder 32B Instruct
32BOpen
$0.240$0.240
17Mistral: Ministral 3 3B 2512
3BOpen
$0.120$0.120
29Mistral: Ministral 3 8B 2512
8BOpen
$0.180$0.180
40Mistral: Ministral 3 14B 2512
14BOpen
$0.240$0.240
1Meta: Llama 3.2 1B Instruct
1BOpen
$0.0050$0.010
2Meta: Llama 3.2 3B Instruct
3BOpen
$0.020$0.020
7Meta: Llama 3.2 11B Vision Instruct
11BOpen
$0.049$0.049
12Qwen: Qwen2.5 7B Instruct
7BOpen
$0.048$0.120
36Qwen2.5 72B Instruct
72BOpen
$0.144$0.468
39Qwen: Qwen2.5-VL 7B Instruct
7BOpen
$0.240$0.240
11Mistral: Mistral Small 3
24BOpen
$0.050$0.080
15Mistral: Mistral Small 3.2 24B
24BOpen
$0.072$0.216
28Mistral: Mistral Small Creative
Open
$0.120$0.360
6Llama Guard 3 8B
8BOpen
$0.024$0.072
26Meta: Llama Guard 4 12B
12BOpen
$0.180$0.180
33Meta: LlamaGuard 2 8B
8BOpen
$0.200$0.200
3Mistral: Mistral Nemo
Open
$0.024$0.048
13NVIDIA: Nemotron 3 Nano 30B A3B
30BOpen
$0.050$0.200
47NVIDIA: Nemotron Nano 12B 2 VL
12BOpen
$0.200$0.600
8Qwen: Qwen2.5 Coder 7B Instruct
7BOpen
$0.036$0.108
46Qwen: Qwen2.5 VL 32B Instruct
32BOpen
$0.200$0.600
4Meta: Llama 3 8B Instruct
8BOpen
$0.030$0.040
21NousResearch: Hermes 2 Pro - Llama-3 8B
8BOpen
$0.140$0.140
37Nous: Hermes 4 70B
70BOpen
$0.156$0.480
48Nous: Hermes 3 70B Instruct
70BOpen
$0.300$0.300
5Mistral: Mistral 7B Instruct v0.3
7BOpen
$0.028$0.054
9Mistral: Mistral 7B Instruct v0.2
7BOpen
$0.055$0.055
10Mistral: Mistral 7B Instruct v0.1
7BOpen
$0.055$0.055
41Mistral: Mistral 7B Instruct
7BOpen
$0.240$0.240

* Prices include ~20% OpenRouter platform fee (verified against actual billing).

Frequently Asked Questions

The cheapest open-weight / open source LLM API is Meta: Llama 3.2 1B Instruct via Deepinfra at $0.0063 per million tokens (blended rate). There are 94 open weight models available.