Skip to main content

Llama API Pricing Comparison 2026

Meta's open-weight Llama models are among the most widely deployed LLMs, available across dozens of inference providers at competitive prices.

116
Models
11
Providers
$0.0063
Cheapest/1M
11
Open Weight

What is the cheapest Llama model?

The cheapest Llama API is Meta: Llama 3.2 1B Instruct via Deepinfra at $0.0063 per million tokens (blended). There are 116 Llama models across 11 providers.

All Llama Models by Price

113 models
#ModelInputOutput
26Llama 4 Scout 17B 16e Instruct
$0.080$0.300
16Cogito V1 Preview Llama 3B
$0.100$0.100
32Cogito V1 Preview Llama 8B
$0.200$0.200
31Meta: Llama 3.3 70B Instruct
70BOpen
$0.120$0.384
1Meta: Llama 3.2 1B Instruct
1BOpen
$0.0050$0.010
2Meta: Llama 3.2 3B Instruct
3BOpen
$0.020$0.020
9Meta: Llama 3.2 11B Vision Instruct
11BOpen
$0.049$0.049
12accounts/fireworks/models/llama-v3p2-3b
$0.100$0.100
17accounts/fireworks/models/llama-v3p2-1b
$0.100$0.100
19eu.meta.llama3-2-1b-instruct-v1:0
$0.100$0.100
27eu.meta.llama3-2-3b-instruct-v1:0
$0.150$0.150
4Meta Llama 3.1 8B Instruct Turbo
$0.020$0.030
5Meta Llama 3.1 8B Instruct
8B
$0.020$0.050
8Llama Guard 3 8B
8BOpen
$0.024$0.072
10Llama 3.1 8B Instant
$0.050$0.080
13accounts/fireworks/models/llama-v3p1-405b-instruct-long
$0.100$0.100
14accounts/fireworks/models/llama-v3p1-70b-instruct-1b
$0.100$0.100
18Llama Guard 3 1B
$0.100$0.100
25Llama3.1 8B
$0.100$0.100
29Meta Llama Guard 3 11B Vision Turbo
$0.180$0.180
30Meta: Llama Guard 4 12B
12BOpen
$0.180$0.180
35Meta Llama Guard 2 8B
$0.200$0.200
36Meta Llama 3.1 8B Instruct
$0.200$0.200
39Llama 3.1 Sonar Small 128K Chat
$0.200$0.200
40Llama 3.1 Sonar Small 128K Online
$0.200$0.200
47Llamaguard 7B
$0.200$0.200
6Meta: Llama 3 8B Instruct
8BOpen
$0.030$0.040
7Meta Llama 3 8B Instruct
$0.030$0.040
15Meta Llama 3 8B Instruct Lite
$0.100$0.100
22Llama 3 8B
$0.050$0.250
24Meta Llama 3 8B Instruct Lite
$0.100$0.100
45Llama V3 8B
$0.200$0.200
46Llama V3 8B Instruct HF
$0.200$0.200
33Code Llama 7B
$0.200$0.200
34Code Llama 13B
$0.200$0.200
49Code Llama 13B Instruct
$0.200$0.200
50Code Llama 13B Python
$0.200$0.200
20Llama V2 70B
$0.100$0.100
21Llama 2 7B Chat
$0.050$0.250
23Llama 2 7B
$0.050$0.250
37Llama 2 13B
$0.100$0.500
38Llama 2 13B Chat
$0.100$0.500
41Llama V2 13B
$0.200$0.200
42Llama V2 13B Chat
$0.200$0.200
43Llama V2 7B
$0.200$0.200
44Llama V2 7B Chat
$0.200$0.200
48Nous Hermes Llama2 13B
$0.200$0.200
3meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
$0.020$0.030
11Meta Llama 3.2 3B Instruct Turbo
$0.060$0.060
28meta-llama/Llama-3.3-70B-Instruct-Turbo
$0.100$0.320

Frequently Asked Questions

The cheapest Llama model is Meta: Llama 3.2 1B Instruct via Deepinfra at $0.0063 per million tokens (blended rate). Prices vary across providers — Sector HQ tracks them all.