Claude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/MPaLM 2 Bison$0.500/MPaLM 2 Unicorn$5.000/MGemma 3 27B$0.270/MGrok 3$6.600/MGrok 2$4.400/MGrok 1.5$8.000/MDeepSeek-V3$0.519/MDeepSeek-V3-0324$0.519/MDeepSeek-R1$1.042/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/MPaLM 2 Bison$0.500/MPaLM 2 Unicorn$5.000/MGemma 3 27B$0.270/MGrok 3$6.600/MGrok 2$4.400/MGrok 1.5$8.000/MDeepSeek-V3$0.519/MDeepSeek-V3-0324$0.519/MDeepSeek-R1$1.042/M
BETA
Meta

Meta

Llama open-weights family — the most-deployed open foundation models. Run them anywhere or rent them from any inference provider.

Founded 2013HQ Menlo Park, USAWebsite ↗API docs ↗
MODELS TRACKED
29
3 categories
FLAGSHIP
Llama 4 Maverick
frontier
MIN INPUT
$0.020/M
cheapest model in family
AVG BLENDED
$0.306/M
across 29 priced models
MAX CONTEXT
512K
largest window in family
Frontier
1 model
Multimodal
2 models
Efficient
26 models
Llama 4 Maverickprofile
frontier · 128K ctx
in $0.200/Mout $0.600/M
MoE architecture
Llama 4 Scoutprofile
efficient · 512K ctx
in $0.100/Mout $0.350/M
Long-context MoE
Llama 3.3 70Bprofile
efficient · 128K ctx
in $0.230/Mout $0.400/M
Strong open weights
Llama 3.1 70Bprofile
efficient · 128K ctx
in $0.350/Mout $0.400/M
Balanced open
Llama 3.1 8Bprofile
efficient · 128K ctx
in $0.100/Mout $0.100/M
Compact open
Llama 3.2 90B Visionprofile
multimodal · 128K ctx
in $0.880/Mout $0.880/M
Vision + language
Llama 3.2 11B Visionprofile
multimodal · 128K ctx
in $0.160/Mout $0.160/M
Compact vision
Llama 3.2 3Bprofile
efficient · 128K ctx
in $0.060/Mout $0.060/M
Ultra-compact
Llama 3.2 1Bprofile
efficient · 128K ctx
in $0.040/Mout $0.040/M
Smallest Llama
Llama 3 70Bprofile
efficient · 8K ctx
in $0.590/Mout $0.790/M
Llama 3 base
Llama 3 8Bprofile
efficient · 8K ctx
in $0.200/Mout $0.200/M
Llama 3 compact
Llama 2 70B Chatprofile
efficient · 4K ctx
in $0.900/Mout $0.900/M
Classic Llama 2
Llama 2 13B Chatprofile
efficient · 4K ctx
in $0.220/Mout $0.220/M
Mid Llama 2
Llama 2 7B Chatprofile
efficient · 4K ctx
in $0.180/Mout $0.180/M
Compact Llama 2
CodeLlama 70B Instructprofile
efficient · 16K ctx
in $0.900/Mout $0.900/M
Largest CodeLlama
CodeLlama 34B Instructprofile
efficient · 16K ctx
in $0.350/Mout $0.350/M
Strong code model
CodeLlama 13B Instructprofile
efficient · 16K ctx
in $0.220/Mout $0.220/M
Mid code model
CodeLlama 7B Instructprofile
efficient · 16K ctx
in $0.180/Mout $0.180/M
Compact code
Llama Guard 3 8B
text->text · 131K ctx
in $0.480/Mout $0.030/M
Llama 3.3 70B Instruct (free)profile
text->text · 66K ctx
in $0.000/Mout $0.000/M
Llama 3.2 3B Instruct (free)profile
text->text · 131K ctx
in $0.000/Mout $0.000/M
Llama 3.2 1B Instructprofile
text->text · 60K ctx
in $0.027/Mout $0.200/M
Llama 3.1 8B Instructprofile
text->text · 16K ctx
in $0.020/Mout $0.050/M
Llama 3.1 70B Instructprofile
text->text · 131K ctx
in $0.400/Mout $0.400/M
Llama 3 8B Instructprofile
text->text · 8K ctx
in $0.040/Mout $0.040/M
Llama 3 70B Instructprofile
text->text · 8K ctx
in $0.510/Mout $0.740/M