Skip to main content

Llama API Pricing Comparison 2026

Meta's open-weight Llama models are among the most widely deployed LLMs, available across dozens of inference providers at competitive prices.

117
Models
10
Providers
$0.0063
Cheapest/1M
11
Open Weight

What is the cheapest Llama model?

The cheapest Llama API is Meta: Llama 3.2 1B Instruct via Deepinfra at $0.0063 per million tokens (blended). There are 117 Llama models across 10 providers.

All Llama Models by Price

113 models
#ModelInputOutput
25Llama 4 Scout 17B 16e Instruct
$0.080$0.300
58Llama 4 Maverick 17B 128e Instruct Fp8
$0.150$0.600
59Llama4 Scout Instruct Basic
$0.150$0.600
60meta.llama4-scout-17b-instruct-v1:0
$0.170$0.660
61Llama 4 Maverick 17B 128E Instruct
$0.200$0.600
66Llama4 Maverick Instruct Basic
$0.220$0.880
70meta.llama4-maverick-17b-instruct-v1:0
$0.240$0.970
12Cogito V1 Preview Llama 3B
$0.100$0.100
31Cogito V1 Preview Llama 8B
$0.200$0.200
81Cogito V1 Preview Llama 70B
$0.900$0.900
54DeepSeek R1 Distill Llama 8B
$0.200$0.200
30Meta: Llama 3.3 70B Instruct
70BOpen
$0.120$0.384
74Llama 3.3 70B
$0.590$0.790
75meta.llama3-3-70b-instruct-v1:0
$0.720$0.720
1Meta: Llama 3.2 1B Instruct
1BOpen
$0.0050$0.010
2Meta: Llama 3.2 3B Instruct
3BOpen
$0.020$0.020
11accounts/fireworks/models/llama-v3p2-3b
$0.100$0.100
16eu.meta.llama3-2-1b-instruct-v1:0
$0.100$0.100
23accounts/fireworks/models/llama-v3p2-1b
$0.100$0.100
26eu.meta.llama3-2-3b-instruct-v1:0
$0.150$0.150
55Meta: Llama 3.2 11B Vision Instruct
11BOpen
$0.200$0.200
63meta.llama3-2-11b-instruct-v1:0
$0.350$0.350
64Llama 3.2 90B Vision Instruct
$0.350$0.400
106meta.llama3-2-90b-instruct-v1:0
$2.00$2.00
4Meta Llama 3.1 8B Instruct Turbo
$0.020$0.030
5Meta Llama 3.1 8B Instruct
8B
$0.020$0.050
8Llama Guard 3 8B
8BOpen
$0.055$0.055
9Llama 3.1 8B Instant
$0.050$0.080
20accounts/fireworks/models/llama-v3p1-70b-instruct-1b
$0.100$0.100
21accounts/fireworks/models/llama-v3p1-405b-instruct-long
$0.100$0.100
22Llama3.1 8B
$0.100$0.100
24Llama Guard 3 1B
$0.100$0.100
28Meta Llama Guard 3 11B Vision Turbo
$0.180$0.180
29Meta: Llama Guard 4 12B
12BOpen
$0.180$0.180
34Meta Llama Guard 2 8B
$0.200$0.200
35Meta Llama 3.1 8B Instruct
$0.200$0.200
38Llama 3.1 Sonar Small 128K Chat
$0.200$0.200
39Llama 3.1 Sonar Small 128K Online
$0.200$0.200
46Llamaguard 7B
$0.200$0.200
53Meta Llama 3.1 8B Instruct
$0.200$0.200
56Meta: LlamaGuard 2 8B
8BOpen
$0.200$0.200
57meta.llama3-1-8b-instruct-v1:0
$0.220$0.220
67Meta Llama 3.1 70B Instruct Turbo
$0.400$0.400
69Llama 3.1 70B Instruct
70B
$0.400$0.400
72Llama3.1 70B
$0.600$0.600
76Meta Llama 3.1 405B Instruct
405B
$0.800$0.800
92Meta Llama 3.1 70B Instruct Reference
$0.900$0.900
94Meta Llama 3.1 70B Instruct
$0.900$0.900
96meta.llama3-1-70b-instruct-v1:0
$0.990$0.990
97Llama 3.1 Sonar Large 128K Online
$1.00$1.00

Frequently Asked Questions

The cheapest Llama model is Meta: Llama 3.2 1B Instruct via Deepinfra at $0.0063 per million tokens (blended rate). Prices vary across providers — Sector HQ tracks them all.