Skip to main content

Cheapest Open-Weight / Open Source LLM APIs in 2026

Open-weight LLMs you can self-host or access through inference providers. Compare pricing for Llama, Mistral, Qwen, and other open models.

What is the cheapest open-weight / open source LLM?

The cheapest open-weight / open source LLM API is Meta: Llama 3.2 1B Instruct via Deepinfra at $0.0063 per million tokens. 94 open weight models are compared across all providers.

Open-Weight / Open Source Models by Price

94 models
#ModelInputOutput
38DeepSeek: DeepSeek V3.2 Exp
Open
$0.210$0.320
79Mistral: Devstral 2 2512
Open
$0.480$2.40
62Qwen: Qwen-Plus
Open
$0.312$0.936
63Qwen: Qwen Plus 0728 (thinking)
Open
$0.312$0.936
14Qwen: Qwen3 Coder 30B A3B Instruct
30BOpen
$0.070$0.260
55Qwen: Qwen3 Coder Next
Open
$0.180$0.960
61Qwen: Qwen3 Coder Flash
Open
$0.234$1.17
65Qwen: Qwen3 Coder 480B A35B
480BOpen
$0.264$1.20
71Qwen: Qwen3 Coder 480B A35B (exacto)
480BOpen
$0.264$2.16
85Qwen: Qwen3 Coder Plus
Open
$0.780$3.90
28Mistral: Voxtral Small 24B 2507
24BOpen
$0.120$0.360
80Mistral: Devstral Medium
Open
$0.480$2.40
25Mistral: Devstral Small 1.1
Open
$0.120$0.360
51DeepSeek: DeepSeek V3.1 Terminus
Open
$0.210$0.790
59DeepSeek: DeepSeek V3.1 Terminus (exacto)
Open
$0.252$0.948
77DeepSeek: R1 0528
Open
$0.500$2.15
13Qwen: Qwen3 235B A22B Instruct 2507
235BOpen
$0.085$0.120
16Qwen: Qwen3 14B
14BOpen
$0.072$0.288
18Qwen: Qwen3 30B A3B
30BOpen
$0.080$0.280
19Qwen: Qwen3 32B
32BOpen
$0.080$0.280
21Qwen: Qwen3 8B
8BOpen
$0.060$0.480
22Qwen: Qwen3 30B A3B Instruct 2507
30BOpen
$0.108$0.360
29Qwen: Qwen3 VL 8B Instruct
8BOpen
$0.080$0.500
31Qwen: Qwen3 30B A3B Thinking 2507
30BOpen
$0.096$0.480
35Qwen: Qwen3 VL 32B Instruct
32BOpen
$0.125$0.499
43Qwen: Qwen3 VL 30B A3B Instruct
30BOpen
$0.150$0.600
45Qwen: Qwen3 235B A22B
235BOpen
$0.180$0.540
49Qwen: Qwen3 Next 80B A3B Thinking
80BOpen
$0.117$0.936
50Qwen: Qwen3 Next 80B A3B Instruct
80BOpen
$0.090$1.10
54Qwen: Qwen3 VL 235B A22B Instruct
235BOpen
$0.200$0.880
56Qwen: Qwen3 235B A22B Thinking 2507
235BOpen
$0.220$0.880
57Qwen: Qwen3 VL 235B A22B Thinking
235BOpen
$0.220$0.880
58Qwen: Qwen3 VL 30B A3B Thinking
30BOpen
$0.200$1.00
66Qwen: Qwen3 VL 8B Thinking
8BOpen
$0.140$1.64
69Qwen: Qwen3.5 Plus 2026-02-15
Open
$0.312$1.87
82Qwen: Qwen3.5 397B A17B
397BOpen
$0.468$2.81
87Qwen: Qwen3 Max
Open
$0.936$4.68
88Qwen: Qwen3 Max Thinking
Open
$0.936$4.68
34Qwen: QwQ 32B
32BOpen
$0.150$0.400
53Mistral: Saba
Open
$0.240$0.720
44DeepSeek: R1 Distill Qwen 32B
32BOpen
$0.270$0.270
70DeepSeek: R1 Distill Llama 70B
70BOpen
$0.700$0.800
23NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
49BOpen
$0.100$0.400
30Meta: Llama 3.3 70B Instruct
70BOpen
$0.120$0.384
74Qwen2.5 Coder 32B Instruct
32BOpen
$0.792$1.20
15Mistral: Ministral 3 3B 2512
3BOpen
$0.120$0.120
27Mistral: Ministral 3 8B 2512
8BOpen
$0.180$0.180
40Mistral: Ministral 3 14B 2512
14BOpen
$0.240$0.240
76NVIDIA: Llama 3.1 Nemotron 70B Instruct
70BOpen
$0.900$0.900
1Meta: Llama 3.2 1B Instruct
1BOpen
$0.0050$0.010

* Prices include ~20% OpenRouter platform fee (verified against actual billing).

Frequently Asked Questions

The cheapest open-weight / open source LLM API is Meta: Llama 3.2 1B Instruct via Deepinfra at $0.0063 per million tokens (blended rate). There are 94 open weight models available.