| PROVIDER↕ | MODEL↕ | INPUT↕ | OUTPUT↕ | CONTEXT↕ | BLEND · 70/30↑ |
|---|---|---|---|---|---|
| DeepSeek V4 Flash (free) deepseek/deepseek-v4-flash:free | $0/M | $0/M | 1.0M | ||
| Lyria 3 Pro Preview google/lyria-3-pro-preview | $0/M | $0/M | 1.0M | ||
| Lyria 3 Clip Preview google/lyria-3-clip-preview | $0/M | $0/M | 1.0M | ||
| Qwen3 Coder 480B A35B (free) qwen/qwen3-coder:free | $0/M | $0/M | 1.0M | ||
| Nemotron 3 Super (free) nvidia/nemotron-3-super-120b-a12b:free | $0/M | $0/M | 1M | ||
| Laguna XS.2 (free) poolside/laguna-xs.2:free | $0/M | $0/M | 262K | ||
| Laguna M.1 (free) poolside/laguna-m.1:free | $0/M | $0/M | 262K | ||
| MoonshotAI: Kimi K2.6 (free) moonshotai/kimi-k2.6:free | $0/M | $0/M | 262K | ||
| Gemma 4 26B A4B (free) google/gemma-4-26b-a4b-it:free | $0/M | $0/M | 262K | ||
| Gemma 4 31B (free) google/gemma-4-31b-it:free | $0/M | $0/M | 262K | ||
| MiniMax M2.5 (free) minimax/minimax-m2.5:free | $0/M | $0/M | 262K | ||
| Qwen3 Next 80B A3B Instruct (free) qwen/qwen3-next-80b-a3b-instruct:free | $0/M | $0/M | 262K | ||
| Nemotron 3 Nano Omni (free) nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free | $0/M | $0/M | 256K | ||
| Nemotron 3 Nano 30B A3B (free) nvidia/nemotron-3-nano-30b-a3b:free | $0/M | $0/M | 256K | ||
| gpt-oss-120b (free) openai/gpt-oss-120b:free | $0/M | $0/M | 131K | ||
| gpt-oss-20b (free) openai/gpt-oss-20b:free | $0/M | $0/M | 131K | ||
| Z.ai: GLM 4.5 Air (free) z-ai/glm-4.5-air:free | $0/M | $0/M | 131K | ||
| Llama 3.3 70B Instruct (free) meta-llama/llama-3.3-70b-instruct:free | $0/M | $0/M | 131K | ||
| Llama 3.2 3B Instruct (free) meta-llama/llama-3.2-3b-instruct:free | $0/M | $0/M | 131K | ||
| Nous: Hermes 3 405B Instruct (free) nousresearch/hermes-3-llama-3.1-405b:free | $0/M | $0/M | 131K | ||
| Nemotron Nano 12B 2 VL (free) nvidia/nemotron-nano-12b-v2-vl:free | $0/M | $0/M | 128K | ||
| Nemotron Nano 9B V2 (free) nvidia/nemotron-nano-9b-v2:free | $0/M | $0/M | 128K | ||
| LiquidAI: LFM2.5-1.2B-Thinking (free) liquid/lfm-2.5-1.2b-thinking:free | $0/M | $0/M | 33K | ||
| LiquidAI: LFM2.5-1.2B-Instruct (free) liquid/lfm-2.5-1.2b-instruct:free | $0/M | $0/M | 33K | ||
| Venice: Uncensored (free) cognitivecomputations/dolphin-mistral-24b-venice-edition:free | $0/M | $0/M | 33K | ||
| Whisper Large v3 Whisper Large v3 | $0.0036/M | — | — | ||
| Ling-2.6-flash inclusionai/ling-2.6-flash | $0.0100/M | $0.0300/M | 262K | ||
| text-embedding-3-small text-embedding-3-small | $0.0200/M | — | 8K | ||
| Gemma 2 2B Gemma 2 2B | $0.0200/M | $0.0200/M | 8K | ||
| Titan Embeddings V2 Titan Embeddings V2 | $0.0200/M | — | 8K | ||
| text-embedding-004 text-embedding-004 | $0.0250/M | — | 2K | ||
| Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct | $0.0200/M | $0.0500/M | 131K | ||
| Qwen2.5-1.5B-Instruct Qwen2.5-1.5B-Instruct | $0.0300/M | $0.0300/M | 32K | ||
| Llama 3.2 1B Llama 3.2 1B | $0.0400/M | $0.0400/M | 128K | ||
| Llama 3 8B Instruct meta-llama/llama-3-8b-instruct | $0.0400/M | $0.0400/M | 8K | ||
| Phi-3-Mini-4K (NIM) Phi-3-Mini-4K (NIM) | $0.0400/M | $0.0400/M | 4K | ||
| SDXL (Rep) SDXL (Rep) | $0.0400/M | — | — | ||
| Llama 3 8B Lunaris sao10k/l3-lunaris-8b | $0.0400/M | $0.0500/M | 8K | ||
| Granite 4.0 Micro ibm-granite/granite-4.0-h-micro | $0.0170/M | $0.1120/M | 131K | ||
| Qwen2.5-3B-Instruct Qwen2.5-3B-Instruct | $0.0500/M | $0.0500/M | 32K | ||
| ABAB 5.5c ABAB 5.5c | $0.0500/M | $0.0500/M | 16K | ||
| Granite 3.1 2B Instruct Granite 3.1 2B Instruct | $0.0300/M | $0.1000/M | 128K | ||
| Gemma 3 4B google/gemma-3-4b-it | $0.0400/M | $0.0800/M | 131K | ||
| Doubao-Lite-128k Doubao-Lite-128k | $0.0400/M | $0.0900/M | 128K | ||
| Doubao-Lite-32k Doubao-Lite-32k | $0.0400/M | $0.0900/M | 32K | ||
| LiquidAI: LFM2-24B-A2B liquid/lfm-2-24b-a2b | $0.0300/M | $0.1200/M | 128K | ||
| Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct | $0.0400/M | $0.1000/M | 131K | ||
| Llama 3.1 8B (Groq) Llama 3.1 8B (Groq) | $0.0500/M | $0.0800/M | 128K | ||
| Mistral Small 3 mistralai/mistral-small-24b-instruct-2501 | $0.0500/M | $0.0800/M | 33K | ||
| Llama 3.2 3B Llama 3.2 3B | $0.0600/M | $0.0600/M | 128K | ||
| Llama 3.2 3B (Groq) Llama 3.2 3B (Groq) | $0.0600/M | $0.0600/M | 128K | ||
| Gemma 2 9B (DI) Gemma 2 9B (DI) | $0.0600/M | $0.0600/M | 8K | ||
| MythoMax 13B gryphe/mythomax-l2-13b | $0.0600/M | $0.0600/M | 4K | ||
| Granite 4.1 8B ibm-granite/granite-4.1-8b | $0.0500/M | $0.1000/M | 131K | ||
| voyage-3-lite voyage-3-lite | $0.0650/M | — | 32K | ||
| Nova Micro Nova Micro | $0.0350/M | $0.1400/M | 128K | ||
| Nova Micro 1.0 amazon/nova-micro-v1 | $0.0350/M | $0.1400/M | 128K | ||
| Gemma 3 12B google/gemma-3-12b-it | $0.0400/M | $0.1300/M | 131K | ||
| Mistral 7B (DI) Mistral 7B (DI) | $0.0700/M | $0.0700/M | 32K | ||
| Mistral 7B (Lepton) Mistral 7B (Lepton) | $0.0700/M | $0.0700/M | 32K | ||
| Gemini 1.5 Flash-8B Gemini 1.5 Flash-8B | $0.0375/M | $0.1500/M | 1M | ||
| Command R7B Command R7B | $0.0375/M | $0.1500/M | 128K | ||
| Qwen3.5-9B qwen/qwen3.5-9b | $0.0400/M | $0.1500/M | 262K | ||
| Phi-4 Mini Phi-4 Mini | $0.0400/M | $0.1600/M | 128K | ||
| Trinity Mini arcee-ai/trinity-mini | $0.0450/M | $0.1500/M | 131K | ||
| Gemma 3n 4B google/gemma-3n-e4b-it | $0.0600/M | $0.1200/M | 33K | ||
| Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct | $0.0270/M | $0.2010/M | 131K | ||
| Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 | $0.0710/M | $0.1000/M | 262K | ||
| Mistral 7B v0.3 Mistral 7B v0.3 | $0.0800/M | $0.0800/M | 32K | ||
| Nova Canvas Nova Canvas | $0.0800/M | — | — | ||
| Phi 4 microsoft/phi-4 | $0.0650/M | $0.1400/M | 16K | ||
| Gemma 3 9B Gemma 3 9B | $0.0900/M | $0.0900/M | 128K | ||
| Gemma 2 9B Gemma 2 9B | $0.0900/M | $0.0900/M | 8K | ||
| Ministral 3 3B 2512 mistralai/ministral-3b-2512 | $0.1000/M | $0.1000/M | 131K | ||
| Llama 3.1 8B (fw) Llama 3.1 8B (fw) | $0.1000/M | $0.1000/M | 131K | ||
| Llama 3.1 8B Llama 3.1 8B | $0.1000/M | $0.1000/M | 128K | ||
| Llama 3.1 8B (Together) Llama 3.1 8B (Together) | $0.1000/M | $0.1000/M | 128K | ||
| Llama-3.1-8B (Cerebras) Llama-3.1-8B (Cerebras) | $0.1000/M | $0.1000/M | 128K | ||
| Qwen2.5-7B-Instruct Qwen2.5-7B-Instruct | $0.1000/M | $0.1000/M | 128K | ||
| Qwen2.5-Coder-7B Qwen2.5-Coder-7B | $0.1000/M | $0.1000/M | 128K | ||
| Z.ai: GLM 4 32B z-ai/glm-4-32b | $0.1000/M | $0.1000/M | 128K | ||
| Reka Edge rekaai/reka-edge | $0.1000/M | $0.1000/M | 16K | ||
| StableCode 3B StableCode 3B | $0.1000/M | $0.1000/M | 16K | ||
| mistral-embed mistral-embed | $0.1000/M | — | 8K | ||
| Solar Embedding Solar Embedding | $0.1000/M | — | 4K | ||
| StableLM 2 1.6B StableLM 2 1.6B | $0.1000/M | $0.1000/M | 4K | ||
| Embed v3 English Embed v3 English | $0.1000/M | — | 500 | ||
| Embed v3 Multilingual Embed v3 Multilingual | $0.1000/M | — | 500 | ||
| Hy3 preview tencent/hy3-preview | $0.0630/M | $0.2100/M | 262K | ||
| DeepSeek-R1-Distill-8B DeepSeek-R1-Distill-8B | $0.0700/M | $0.2000/M | 128K | ||
| Granite 3.1 8B Instruct Granite 3.1 8B Instruct | $0.0500/M | $0.2500/M | 128K | ||
| Llama 3 8B (Rep) Llama 3 8B (Rep) | $0.0500/M | $0.2500/M | 8K | ||
| Granite 3.0 8B Dense Granite 3.0 8B Dense | $0.0500/M | $0.2500/M | 4K | ||
| Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct | $0.0750/M | $0.2000/M | 128K | ||
| Nova Lite Nova Lite | $0.0600/M | $0.2400/M | 300K | ||
| Nova Lite 1.0 amazon/nova-lite-v1 | $0.0600/M | $0.2400/M | 300K | ||
| voyage-3 voyage-3 | $0.1200/M | — | 32K | ||
| voyage-finance-2 voyage-finance-2 | $0.1200/M | — | 32K | ||
| voyage-law-2 voyage-law-2 | $0.1200/M | — | 32K | ||
| voyage-multilingual-2 voyage-multilingual-2 | $0.1200/M | — | 32K | ||
| Moonshot v1 8k Moonshot v1 8k | $0.1200/M | $0.1200/M | 8K | ||
| Qwen3.5-Flash qwen/qwen3.5-flash-02-23 | $0.0650/M | $0.2600/M | 1M | ||
| Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | $0.0700/M | $0.2700/M | 160K | ||
| UI-TARS 7B bytedance/ui-tars-1.5-7b | $0.1000/M | $0.2000/M | 128K | ||
| Reka Flash 3 rekaai/reka-flash-3 | $0.1000/M | $0.2000/M | 66K | ||
| text-embedding-3-large text-embedding-3-large | $0.1300/M | — | 8K | ||
| ERNIE 4.5 21B A3B Thinking baidu/ernie-4.5-21b-a3b-thinking | $0.0700/M | $0.2800/M | 131K | ||
| ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b | $0.0700/M | $0.2800/M | 131K | ||
| Phi-4 Phi-4 | $0.0700/M | $0.2800/M | 16K | ||
| Mistral 7B Instruct v0.1 mistralai/mistral-7b-instruct-v0.1 | $0.1100/M | $0.1900/M | 4K | ||
| Qwen3 32B qwen/qwen3-32b | $0.0800/M | $0.2800/M | 131K | ||
| GLM-4-Air GLM-4-Air | $0.1400/M | $0.1400/M | 128K | ||
| Yi-Lightning Yi-Lightning | $0.1400/M | $0.1400/M | 16K | ||
| Hermes 2 Pro - Llama-3 8B nousresearch/hermes-2-pro-llama-3-8b | $0.1400/M | $0.1400/M | 8K | ||
| Qwen3 14B qwen/qwen3-14b | $0.1000/M | $0.2400/M | 132K | ||
| Gemini 2.0 Flash Lite Gemini 2.0 Flash Lite | $0.0750/M | $0.3000/M | 1M | ||
| Gemini 1.5 Flash Gemini 1.5 Flash | $0.0750/M | $0.3000/M | 1M | ||
| Seed 1.6 Flash bytedance-seed/seed-1.6-flash | $0.0750/M | $0.3000/M | 262K | ||
| gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b | $0.0750/M | $0.3000/M | 131K | ||
| Ministral 3 8B 2512 mistralai/ministral-8b-2512 | $0.1500/M | $0.1500/M | 262K | ||
| Pixtral 12B Pixtral 12B | $0.1500/M | $0.1500/M | 128K | ||
| Mistral Nemo Mistral Nemo | $0.1500/M | $0.1500/M | 128K | ||
| Mistral-NeMo-12B (NIM) Mistral-NeMo-12B (NIM) | $0.1500/M | $0.1500/M | 128K | ||
| Rnj 1 Instruct essentialai/rnj-1-instruct | $0.1500/M | $0.1500/M | 33K | ||
| Mistral 7B (Anyscale) Mistral 7B (Anyscale) | $0.1500/M | $0.1500/M | 32K | ||
| Yi-6B Yi-6B | $0.1500/M | $0.1500/M | 4K | ||
| Step 3.5 Flash stepfun/step-3.5-flash | $0.0900/M | $0.3000/M | 262K | ||
| Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 | $0.0900/M | $0.3000/M | 262K | ||
| GPT-5 Nano openai/gpt-5-nano | $0.0500/M | $0.4000/M | 400K | ||
| Qwen3 8B qwen/qwen3-8b | $0.0500/M | $0.4000/M | 131K | ||
| MiMo-V2-Flash xiaomi/mimo-v2-flash | $0.1000/M | $0.3000/M | 262K | ||
| Devstral Small 1.1 mistralai/devstral-small | $0.1000/M | $0.3000/M | 131K | ||
| Llama 3.2 11B Vision Llama 3.2 11B Vision | $0.1600/M | $0.1600/M | 128K | ||
| Mistral Small 3.2 Mistral Small 3.2 | $0.1000/M | $0.3000/M | 128K | ||
| Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507 | $0.1000/M | $0.3000/M | 32K | ||
| Phi 4 Mini Instruct microsoft/phi-4-mini-instruct | $0.0800/M | $0.3500/M | 131K | ||
| Doubao-Pro-32k Doubao-Pro-32k | $0.1100/M | $0.2800/M | 32K | ||
| Z.ai: GLM 4.7 Flash z-ai/glm-4.7-flash | $0.0600/M | $0.4000/M | 203K | ||
| Titan Text Lite Titan Text Lite | $0.1500/M | $0.2000/M | 4K | ||
| Llama 4 Scout Llama 4 Scout | $0.1000/M | $0.3500/M | 512K | ||
| DeepSeek-R1-Distill-14B DeepSeek-R1-Distill-14B | $0.1000/M | $0.3500/M | 128K | ||
| Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 | $0.0800/M | $0.4000/M | 131K | ||
| Llama Guard 4 12B meta-llama/llama-guard-4-12b | $0.1800/M | $0.1800/M | 164K | ||
| Spotlight arcee-ai/spotlight | $0.1800/M | $0.1800/M | 131K | ||
| Llama 3.2 11B Vision (Groq) Llama 3.2 11B Vision (Groq) | $0.1800/M | $0.1800/M | 128K | ||
| voyage-3-large voyage-3-large | $0.1800/M | — | 32K | ||
| voyage-code-3 voyage-code-3 | $0.1800/M | — | 32K | ||
| CodeLlama 7B Instruct CodeLlama 7B Instruct | $0.1800/M | $0.1800/M | 16K | ||
| Llama 2 7B Chat Llama 2 7B Chat | $0.1800/M | $0.1800/M | 4K | ||
| Falcon 7B Instruct Falcon 7B Instruct | $0.1800/M | $0.1800/M | 2K | ||
| MiMo-V2.5 xiaomi/mimo-v2.5 | $0.1400/M | $0.2800/M | 1.0M | ||
| DeepSeek-Coder-V2 DeepSeek-Coder-V2 | $0.1400/M | $0.2800/M | 128K | ||
| Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025 | $0.1000/M | $0.4000/M | 1.0M | ||
| Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | $0.1000/M | $0.4000/M | 1.0M | ||
| GPT-4.1 nano GPT-4.1 nano | $0.1000/M | $0.4000/M | 1M | ||
| Gemini 2.0 Flash Gemini 2.0 Flash | $0.1000/M | $0.4000/M | 1M | ||
| Seed-2.0-Mini bytedance-seed/seed-2.0-mini | $0.1000/M | $0.4000/M | 262K | ||
| Llama 3.3 Nemotron Super 49B V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 | $0.1000/M | $0.4000/M | 131K | ||
| Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct | $0.1040/M | $0.4160/M | 262K | ||
| Qwen3 30B A3B qwen/qwen3-30b-a3b | $0.0900/M | $0.4500/M | 131K | ||
| Ministral 3 14B 2512 mistralai/ministral-14b-2512 | $0.2000/M | $0.2000/M | 262K | ||
| Mixtral 8x7B (fw) Mixtral 8x7B (fw) | $0.2000/M | $0.2000/M | 32K | ||
| Llama 3 8B Llama 3 8B | $0.2000/M | $0.2000/M | 8K | ||
| Gemma 2 9B (Groq) Gemma 2 9B (Groq) | $0.2000/M | $0.2000/M | 8K | ||
| Falcon 11B Falcon 11B | $0.2000/M | $0.2000/M | 8K | ||
| Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct | $0.0800/M | $0.5000/M | 256K | ||
| Nous: Hermes 4 70B nousresearch/hermes-4-70b | $0.1300/M | $0.4000/M | 131K | ||
| CodeLlama 13B Instruct CodeLlama 13B Instruct | $0.2200/M | $0.2200/M | 16K | ||
| Llama 2 13B Chat Llama 2 13B Chat | $0.2200/M | $0.2200/M | 4K | ||
| Ring-2.6-1T inclusionai/ring-2.6-1t | $0.0750/M | $0.6250/M | 262K | ||
| Ling-2.6-1T inclusionai/ling-2.6-1t | $0.0750/M | $0.6250/M | 262K | ||
| Mixtral 8x7B v0.1 Mixtral 8x7B v0.1 | $0.2400/M | $0.2400/M | 32K | ||
| Mixtral 8x7B (Groq) Mixtral 8x7B (Groq) | $0.2400/M | $0.2400/M | 32K | ||
| Mixtral 8x7B (DI) Mixtral 8x7B (DI) | $0.2400/M | $0.2400/M | 32K | ||
| DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1 | $0.1350/M | $0.5000/M | 131K | ||
| Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct | $0.2450/M | $0.2450/M | 131K | ||
| Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct | $0.1300/M | $0.5200/M | 262K | ||
| Phi-3.5 Mini Phi-3.5 Mini | $0.1300/M | $0.5200/M | 128K | ||
| Phi-3 Mini Phi-3 Mini | $0.1300/M | $0.5200/M | 128K | ||
| Rocinante 12B thedrummer/rocinante-12b | $0.1700/M | $0.4300/M | 33K | ||
| Codestral Mamba Codestral Mamba | $0.2500/M | $0.2500/M | 256K | ||
| Olmo 3 32B Think allenai/olmo-3-32b-think | $0.1500/M | $0.5000/M | 66K | ||
| Jamba 1.5 Mini Jamba 1.5 Mini | $0.2000/M | $0.4000/M | 256K | ||
| ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b | $0.1400/M | $0.5600/M | 131K | ||
| Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct | $0.1400/M | $0.5700/M | 131K | ||
| Gemma 3 27B Gemma 3 27B | $0.2700/M | $0.2700/M | 128K | ||
| Gemma 2 27B Gemma 2 27B | $0.2700/M | $0.2700/M | 8K | ||
| Llama 3.3 70B Llama 3.3 70B | $0.2300/M | $0.4000/M | 128K | ||
| Llama 3.3 70B (DI) Llama 3.3 70B (DI) | $0.2300/M | $0.4000/M | 128K | ||
| Gemini 2.5 Flash Gemini 2.5 Flash | $0.1500/M | $0.6000/M | 1M | ||
| Mistral Small 4 mistralai/mistral-small-2603 | $0.1500/M | $0.6000/M | 262K | ||
| GPT-4o mini GPT-4o mini | $0.1500/M | $0.6000/M | 128K | ||
| Command R Command R | $0.1500/M | $0.6000/M | 128K | ||
| Phi-3 Small Phi-3 Small | $0.1500/M | $0.6000/M | 128K | ||
| Solar Pro 3 upstage/solar-pro-3 | $0.1500/M | $0.6000/M | 128K | ||
| GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview | $0.1500/M | $0.6000/M | 128K | ||
| GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 | $0.1500/M | $0.6000/M | 128K | ||
| Solar Mini Solar Mini | $0.1500/M | $0.6000/M | 32K | ||
| DeepSeek V3.2 deepseek/deepseek-v3.2 | $0.2520/M | $0.3780/M | 131K | ||
| DeepSeek-R1-Distill-32B DeepSeek-R1-Distill-32B | $0.2000/M | $0.5000/M | 128K | ||
| R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b | $0.2900/M | $0.2900/M | 128K | ||
| Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b | $0.3000/M | $0.3000/M | 131K | ||
| Qwen2-VL-7B Qwen2-VL-7B | $0.3000/M | $0.3000/M | 32K | ||
| Yi-9B Yi-9B | $0.3000/M | $0.3000/M | 4K | ||
| Yi-VL-6B Yi-VL-6B | $0.3000/M | $0.3000/M | 4K | ||
| StableLM 2 12B StableLM 2 12B | $0.3000/M | $0.3000/M | 4K | ||
| Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking | $0.0975/M | $0.7800/M | 262K | ||
| Phi-3.5 MoE Phi-3.5 MoE | $0.1600/M | $0.6400/M | 128K | ||
| DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp | $0.2700/M | $0.4100/M | 164K | ||
| Qwen3 Coder Next qwen/qwen3-coder-next | $0.1100/M | $0.8000/M | 262K | ||
| Codestral Codestral | $0.2000/M | $0.6000/M | 256K | ||
| Llama 4 Maverick Llama 4 Maverick | $0.2000/M | $0.6000/M | 128K | ||
| Saba mistralai/mistral-saba | $0.2000/M | $0.6000/M | 33K | ||
| Phi-3 Medium Phi-3 Medium | $0.1700/M | $0.6800/M | 128K | ||
| ABAB 5.5s ABAB 5.5s | $0.1500/M | $0.7500/M | 16K | ||
| DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale | $0.2870/M | $0.4310/M | 164K | ||
| Llama Guard 3 8B meta-llama/llama-guard-3-8b | $0.4840/M | $0.0300/M | 131K | ||
| Qwen2.5-14B-Instruct Qwen2.5-14B-Instruct | $0.3500/M | $0.3500/M | 128K | ||
| Qwen2.5-Coder-14B Qwen2.5-Coder-14B | $0.3500/M | $0.3500/M | 128K | ||
| CodeLlama 34B Instruct CodeLlama 34B Instruct | $0.3500/M | $0.3500/M | 16K | ||
| Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1 | $0.3000/M | $0.5000/M | 131K | ||
| Grok 3 mini Grok 3 mini | $0.3000/M | $0.5000/M | 131K | ||
| Llama 3.1 70B Llama 3.1 70B | $0.3500/M | $0.4000/M | 128K | ||
| Llama 3.1 70B (DI) Llama 3.1 70B (DI) | $0.3500/M | $0.4000/M | 128K | ||
| Qwen2.5-72B (DI) Qwen2.5-72B (DI) | $0.3500/M | $0.4000/M | 128K | ||
| Llama-3.1-Nemotron-70B Llama-3.1-Nemotron-70B | $0.3500/M | $0.4000/M | 128K | ||
| DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 | $0.2000/M | $0.7700/M | 164K | ||
| Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct | $0.3600/M | $0.4000/M | 131K | ||
| Yi-Large-Turbo Yi-Large-Turbo | $0.3800/M | $0.3800/M | 16K | ||
| DeepSeek V3.1 deepseek/deepseek-chat-v3.1 | $0.2100/M | $0.7900/M | 164K | ||
| Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b | $0.1390/M | $1.00/M | 262K | ||
| Qwen3.6 35B A3B qwen/qwen3.6-35b-a3b | $0.1400/M | $1.00/M | 262K | ||
| Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct | $0.2500/M | $0.7500/M | 131K | ||
| Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct | $0.4000/M | $0.4000/M | 131K | ||
| Llama 3.1 70B (Hyp) Llama 3.1 70B (Hyp) | $0.4000/M | $0.4000/M | 128K | ||
| Hermes-3-70B (Hyp) Hermes-3-70B (Hyp) | $0.4000/M | $0.4000/M | 128K | ||
| Qwen2.5-72B (Hyp) Qwen2.5-72B (Hyp) | $0.4000/M | $0.4000/M | 128K | ||
| Inception: Mercury 2 inception/mercury-2 | $0.2500/M | $0.7500/M | 128K | ||
| UnslopNemo 12B thedrummer/unslopnemo-12b | $0.4000/M | $0.4000/M | 33K | ||
| Moonshot v1 32k Moonshot v1 32k | $0.4000/M | $0.4000/M | 32K | ||
| Granite 20B Multilingual Granite 20B Multilingual | $0.4000/M | $0.4000/M | 8K | ||
| Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct | $0.2000/M | $0.8800/M | 262K | ||
| Trinity Large Thinking arcee-ai/trinity-large-thinking | $0.2200/M | $0.8500/M | 262K | ||
| Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct | $0.3510/M | $0.5550/M | 128K | ||
| Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking | $0.2600/M | $0.7800/M | 1M | ||
| Qwen Plus 0728 qwen/qwen-plus-2025-07-28 | $0.2600/M | $0.7800/M | 1M | ||
| Qwen-Plus qwen/qwen-plus | $0.2600/M | $0.7800/M | 1M | ||
| Qwen3 Coder Flash qwen/qwen3-coder-flash | $0.1950/M | $0.9750/M | 1M | ||
| DeepSeek V3 deepseek/deepseek-chat | $0.2288/M | $0.9144/M | 131K | ||
| ABAB 6.5t ABAB 6.5t | $0.4500/M | $0.4500/M | 8K | ||
| Hunyuan-Standard Hunyuan-Standard | $0.3500/M | $0.7000/M | 256K | ||
| Qwen3.6 Flash qwen/qwen3.6-flash | $0.1875/M | $1.13/M | 1M | ||
| MiniMax-01 minimax/minimax-01 | $0.2000/M | $1.10/M | 1.0M | ||
| INTELLECT-3 prime-intellect/intellect-3 | $0.2000/M | $1.10/M | 131K | ||
| DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus | $0.2700/M | $0.9500/M | 164K | ||
| MiniMax M2 minimax/minimax-m2 | $0.2550/M | $1.00/M | 205K | ||
| Codestral 2508 mistralai/codestral-2508 | $0.3000/M | $0.9000/M | 256K | ||
| Z.ai: GLM 4.6V z-ai/glm-4.6v | $0.3000/M | $0.9000/M | 131K | ||
| MiniMax M2.1 minimax/minimax-m2.1 | $0.2900/M | $0.9500/M | 205K | ||
| Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking | $0.1170/M | $1.36/M | 256K | ||
| Mixtral 8x7B (Anyscale) Mixtral 8x7B (Anyscale) | $0.5000/M | $0.5000/M | 32K | ||
| Mixtral 8x7B (Lepton) Mixtral 8x7B (Lepton) | $0.5000/M | $0.5000/M | 32K | ||
| DeepSeek-R1-Distill-70B DeepSeek-R1-Distill-70B | $0.3500/M | $0.8800/M | 128K | ||
| Mixtral 8x7B (Rep) Mixtral 8x7B (Rep) | $0.3000/M | $1.00/M | 32K | ||
| ReMM SLERP 13B undi95/remm-slerp-l2-13b | $0.4500/M | $0.6500/M | 6K | ||
| GPT-5.4 Nano openai/gpt-5.4-nano | $0.2000/M | $1.25/M | 400K | ||
| DeepSeek-V3-0324 DeepSeek-V3-0324 | $0.2700/M | $1.10/M | 128K | ||
| DeepSeek-V3 DeepSeek-V3 | $0.2700/M | $1.10/M | 64K | ||
| ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b | $0.2800/M | $1.10/M | 131K | ||
| Claude 3 Haiku anthropic/claude-3-haiku | $0.2500/M | $1.25/M | 200K | ||
| Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | $0.1495/M | $1.50/M | 262K | ||
| Perceptron Mk1 perceptron/perceptron-mk1 | $0.1500/M | $1.50/M | 33K | ||
| MiniMax M2.7 minimax/minimax-m2.7 | $0.2790/M | $1.20/M | 205K | ||
| Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking | $0.1300/M | $1.56/M | 131K | ||
| Jamba Instruct Jamba Instruct | $0.5000/M | $0.7000/M | 256K | ||
| Granite Code 34B Granite Code 34B | $0.3500/M | $1.05/M | 8K | ||
| DeepSeek V4 Pro deepseek/deepseek-v4-pro | $0.4350/M | $0.8700/M | 1.0M | ||
| MiMo-V2.5-Pro xiaomi/mimo-v2.5-pro | $0.4350/M | $0.8700/M | 1.0M | ||
| KAT-Coder-Pro V2 kwaipilot/kat-coder-pro-v2 | $0.3000/M | $1.20/M | 256K | ||
| MiniMax M2-her minimax/minimax-m2-her | $0.3000/M | $1.20/M | 66K | ||
| Llama 3 70B Instruct meta-llama/llama-3-70b-instruct | $0.5100/M | $0.7400/M | 8K | ||
| Coder Large arcee-ai/coder-large | $0.5000/M | $0.8000/M | 33K | ||
| Phind-CodeLlama-34B (DI) Phind-CodeLlama-34B (DI) | $0.6000/M | $0.6000/M | 16K | ||
| Yi-34B-Chat (DI) Yi-34B-Chat (DI) | $0.6000/M | $0.6000/M | 4K | ||
| Qwen3.5-27B qwen/qwen3.5-27b | $0.1950/M | $1.56/M | 262K | ||
| WizardLM-2 8x22B microsoft/wizardlm-2-8x22b | $0.6200/M | $0.6200/M | 66K | ||
| Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite | $0.2500/M | $1.50/M | 1.0M | ||
| Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview | $0.2500/M | $1.50/M | 1.0M | ||
| Skyfall 36B V2 thedrummer/skyfall-36b-v2 | $0.5500/M | $0.8000/M | 33K | ||
| WizardLM-2 8x22B (DI) WizardLM-2 8x22B (DI) | $0.6300/M | $0.6300/M | 64K | ||
| Llama 3.3 70B (Groq) Llama 3.3 70B (Groq) | $0.5900/M | $0.7900/M | 128K | ||
| Llama 3.1 70B (Groq) Llama 3.1 70B (Groq) | $0.5900/M | $0.7900/M | 128K | ||
| Qwen2-57B-A14B Qwen2-57B-A14B | $0.6500/M | $0.6500/M | 64K | ||
| Llama 3 70B Llama 3 70B | $0.5900/M | $0.7900/M | 8K | ||
| ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b | $0.4200/M | $1.25/M | 131K | ||
| Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b | $0.6500/M | $0.7500/M | 131K | ||
| DeepSeek-R1 (Cerebras) DeepSeek-R1 (Cerebras) | $0.5500/M | $0.9900/M | 64K | ||
| GLM-4-Long GLM-4-Long | $0.7000/M | $0.7000/M | 1M | ||
| Qwen2.5-32B-Instruct Qwen2.5-32B-Instruct | $0.7000/M | $0.7000/M | 128K | ||
| Qwen2.5-Coder-32B Qwen2.5-Coder-32B | $0.7000/M | $0.7000/M | 128K | ||
| Llama-3.3-70B (Cerebras) Llama-3.3-70B (Cerebras) | $0.5900/M | $0.9900/M | 128K | ||
| ERNIE 3.5 8K ERNIE 3.5 8K | $0.5500/M | $1.10/M | 8K | ||
| R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b | $0.7000/M | $0.8000/M | 131K | ||
| Qwen3.5 Plus 2026-04-20 qwen/qwen3.5-plus-20260420 | $0.3000/M | $1.80/M | 1M | ||
| GPT-4.1 mini GPT-4.1 mini | $0.4000/M | $1.60/M | 1M | ||
| Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct | $0.6600/M | $1.00/M | 128K | ||
| GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | $0.2500/M | $2.00/M | 400K | ||
| GPT-5 Mini openai/gpt-5-mini | $0.2500/M | $2.00/M | 400K | ||
| Seed-2.0-Lite bytedance-seed/seed-2.0-lite | $0.2500/M | $2.00/M | 262K | ||
| Seed 1.6 bytedance-seed/seed-1.6 | $0.2500/M | $2.00/M | 262K | ||
| Qwen2.5-72B (Groq) Qwen2.5-72B (Groq) | $0.7900/M | $0.7900/M | 128K | ||
| Qwen2.5-Coder-32B (Groq) Qwen2.5-Coder-32B (Groq) | $0.7900/M | $0.7900/M | 128K | ||
| Mistral Large 3 2512 mistralai/mistral-large-2512 | $0.5000/M | $1.50/M | 262K | ||
| Llama 3.1 405B Llama 3.1 405B | $0.8000/M | $0.8000/M | 128K | ||
| Aya Expanse 32B Aya Expanse 32B | $0.5000/M | $1.50/M | 128K | ||
| Llama 3.1 405B (DI) Llama 3.1 405B (DI) | $0.8000/M | $0.8000/M | 128K | ||
| Llama 3.1 70B (Lepton) Llama 3.1 70B (Lepton) | $0.8000/M | $0.8000/M | 128K | ||
| Titan Text Premier Titan Text Premier | $0.5000/M | $1.50/M | 32K | ||
| Phind-CodeLlama-34B (fw) Phind-CodeLlama-34B (fw) | $0.8000/M | $0.8000/M | 16K | ||
| Aya Expanse 8B Aya Expanse 8B | $0.5000/M | $1.50/M | 8K | ||
| Nous-Hermes-2-Yi-34B Nous-Hermes-2-Yi-34B | $0.8000/M | $0.8000/M | 4K | ||
| Yi-34B-Chat Yi-34B-Chat | $0.8000/M | $0.8000/M | 4K | ||
| Yi-34B Yi-34B | $0.8000/M | $0.8000/M | 4K | ||
| Z.ai: GLM 4.7 z-ai/glm-4.7 | $0.4000/M | $1.75/M | 203K | ||
| Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b | $0.2600/M | $2.08/M | 262K | ||
| Doubao-Pro-256k Doubao-Pro-256k | $0.5600/M | $1.40/M | 256K | ||
| Doubao-Pro-128k Doubao-Pro-128k | $0.5600/M | $1.40/M | 128K | ||
| Qwen3.6 Plus qwen/qwen3.6-plus | $0.3250/M | $1.95/M | 1M | ||
| DeepSeek-R1 (Groq) DeepSeek-R1 (Groq) | $0.7500/M | $0.9900/M | 128K | ||
| Z.ai: GLM 4.6 z-ai/glm-4.6 | $0.4300/M | $1.74/M | 203K | ||
| Weaver (alpha) mancer/weaver | $0.7500/M | $1.00/M | 8K | ||
| MoonshotAI: Kimi K2.5 moonshotai/kimi-k2.5 | $0.4000/M | $1.90/M | 262K | ||
| Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b | $0.8500/M | $0.8500/M | 131K | ||
| Qwen3 235B A22B qwen/qwen3-235b-a22b | $0.4550/M | $1.82/M | 131K | ||
| MiMo-V2-Omni xiaomi/mimo-v2-omni | $0.4000/M | $2.00/M | 262K | ||
| Devstral 2 2512 mistralai/devstral-2512 | $0.4000/M | $2.00/M | 262K | ||
| Mistral Medium 3.1 mistralai/mistral-medium-3.1 | $0.4000/M | $2.00/M | 131K | ||
| Devstral Medium mistralai/devstral-medium | $0.4000/M | $2.00/M | 131K | ||
| Mistral Medium 3 mistralai/mistral-medium-3 | $0.4000/M | $2.00/M | 131K | ||
| Reka Edge Reka Edge | $0.4000/M | $2.00/M | 128K | ||
| Llama 3.2 90B Vision Llama 3.2 90B Vision | $0.8800/M | $0.8800/M | 128K | ||
| Llama 3.3 70B (Together) Llama 3.3 70B (Together) | $0.8800/M | $0.8800/M | 128K | ||
| Virtuoso Large arcee-ai/virtuoso-large | $0.7500/M | $1.20/M | 131K | ||
| Qwen2-72B-Instruct Qwen2-72B-Instruct | $0.9000/M | $0.9000/M | 128K | ||
| Mixtral 8x22B Mixtral 8x22B | $0.9000/M | $0.9000/M | 64K | ||
| Qwen2.5-72B (fw) Qwen2.5-72B (fw) | $0.9000/M | $0.9000/M | 32K | ||
| CodeLlama 70B Instruct CodeLlama 70B Instruct | $0.9000/M | $0.9000/M | 16K | ||
| Llama 2 70B Chat Llama 2 70B Chat | $0.9000/M | $0.9000/M | 4K | ||
| Yi-34B (fw) Yi-34B (fw) | $0.9000/M | $0.9000/M | 4K | ||
| Falcon 40B (Together) Falcon 40B (Together) | $0.9000/M | $0.9000/M | 2K | ||
| Falcon 40B Instruct Falcon 40B Instruct | $0.9000/M | $0.9000/M | 2K | ||
| Hunyuan-Standard-256K Hunyuan-Standard-256K | $0.7000/M | $1.40/M | 256K | ||
| AionLabs: Aion-1.0-Mini aion-labs/aion-1.0-mini | $0.7000/M | $1.40/M | 131K | ||
| Morph V3 Fast morph/morph-v3-fast | $0.8000/M | $1.20/M | 82K | ||
| CodeLLaMa 7B Instruct Solidity alfredpros/codellama-7b-instruct-solidity | $0.8000/M | $1.20/M | 4K | ||
| MiniMax M1 minimax/minimax-m1 | $0.4000/M | $2.20/M | 1M | ||
| Palmyra X 004 Palmyra X 004 | $0.5000/M | $2.00/M | 128K | ||
| Nova 2 Lite amazon/nova-2-lite-v1 | $0.3000/M | $2.50/M | 1M | ||
| Z.ai: GLM 4.5V z-ai/glm-4.5v | $0.6000/M | $1.80/M | 66K | ||
| Nano Banana (Gemini 2.5 Flash Image) google/gemini-2.5-flash-image | $0.3000/M | $2.50/M | 33K | ||
| Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking | $0.2600/M | $2.60/M | 131K | ||
| Relace Apply 3 relace/relace-apply-3 | $0.8500/M | $1.25/M | 256K | ||
| Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b | $0.3900/M | $2.34/M | 262K | ||
| Llama-3.1-405B (Cerebras) Llama-3.1-405B (Cerebras) | $0.9900/M | $0.9900/M | 128K | ||
| R1 0528 deepseek/deepseek-r1-0528 | $0.5000/M | $2.15/M | 164K | ||
| Z.ai: GLM 5 z-ai/glm-5 | $0.6000/M | $1.92/M | 203K | ||
| Llama 3.1 70B (Anyscale) Llama 3.1 70B (Anyscale) | $1.00/M | $1.00/M | 128K | ||
| Sonar Sonar | $1.00/M | $1.00/M | 127K | ||
| CodeLlama 70B (Anyscale) CodeLlama 70B (Anyscale) | $1.00/M | $1.00/M | 16K | ||
| AionLabs: Aion-2.0 aion-labs/aion-2.0 | $0.8000/M | $1.60/M | 131K | ||
| AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b | $0.8000/M | $1.60/M | 33K | ||
| DeepSeek-R1 DeepSeek-R1 | $0.5500/M | $2.19/M | 64K | ||
| DeepSeek-R1-0528 DeepSeek-R1-0528 | $0.5500/M | $2.19/M | 64K | ||
| DeepSeek-R1 (DI) DeepSeek-R1 (DI) | $0.5500/M | $2.19/M | 64K | ||
| Z.ai: GLM 4.5 z-ai/glm-4.5 | $0.6000/M | $2.20/M | 131K | ||
| MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2 | $0.5700/M | $2.30/M | 131K | ||
| QwQ-32B QwQ-32B | $0.6000/M | $2.40/M | 131K | ||
| GPT Audio Mini openai/gpt-audio-mini | $0.6000/M | $2.40/M | 128K | ||
| Qwen3.6 27B qwen/qwen3.6-27b | $0.2900/M | $3.20/M | 262K | ||
| MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking | $0.6000/M | $2.50/M | 262K | ||
| MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905 | $0.6000/M | $2.50/M | 262K | ||
| DBRX Instruct DBRX Instruct | $0.7500/M | $2.25/M | 32K | ||
| DBRX Base DBRX Base | $0.7500/M | $2.25/M | 32K | ||
| Morph V3 Large morph/morph-v3-large | $0.9000/M | $1.90/M | 262K | ||
| Qwen2.5-72B-Instruct Qwen2.5-72B-Instruct | $1.20/M | $1.20/M | 128K | ||
| Mixtral 8x22B (Together) Mixtral 8x22B (Together) | $1.20/M | $1.20/M | 64K | ||
| WizardLM-2 8x22B WizardLM-2 8x22B | $1.20/M | $1.20/M | 64K | ||
| Qwen2.5-72B (Together) Qwen2.5-72B (Together) | $1.20/M | $1.20/M | 32K | ||
| DBRX Instruct (Together) DBRX Instruct (Together) | $1.20/M | $1.20/M | 32K | ||
| R1 deepseek/deepseek-r1 | $0.7000/M | $2.50/M | 164K | ||
| Gemini 3 Flash Preview google/gemini-3-flash-preview | $0.5000/M | $3.00/M | 1.0M | ||
| Nano Banana 2 (Gemini 3.1 Flash Image Preview) google/gemini-3.1-flash-image-preview | $0.5000/M | $3.00/M | 131K | ||
| Deep Cogito: Cogito v2.1 671B deepcogito/cogito-v2.1-671b | $1.25/M | $1.25/M | 128K | ||
| Grok Build 0.1 x-ai/grok-build-0.1 | $1.00/M | $2.00/M | 256K | ||
| GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 | $1.00/M | $2.00/M | 4K | ||
| Qianfan-OCR-Fast baidu/qianfan-ocr-fast | $0.6800/M | $2.81/M | 66K | ||
| MiniMax-Text-01 MiniMax-Text-01 | $0.7000/M | $2.80/M | 1M | ||
| Qwen3 Coder Plus qwen/qwen3-coder-plus | $0.6500/M | $3.25/M | 1M | ||
| Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b | $1.48/M | $1.48/M | 8K | ||
| Nova Pro Nova Pro | $0.8000/M | $3.20/M | 300K | ||
| Nova Pro 1.0 amazon/nova-pro-v1 | $0.8000/M | $3.20/M | 300K | ||
| MiMo-V2-Pro xiaomi/mimo-v2-pro | $1.00/M | $3.00/M | 1.0M | ||
| Relace Search relace/relace-search | $1.00/M | $3.00/M | 256K | ||
| Nous: Hermes 4 405B nousresearch/hermes-4-405b | $1.00/M | $3.00/M | 131K | ||
| Llama-3.1-Nemotron-Ultra-253B Llama-3.1-Nemotron-Ultra-253B | $1.60/M | $1.60/M | 128K | ||
| Llama 3.1 70B (DB) Llama 3.1 70B (DB) | $1.00/M | $3.00/M | 128K | ||
| Palmyra X 32k Palmyra X 32k | $1.00/M | $3.00/M | 32K | ||
| Palmyra Med Palmyra Med | $1.00/M | $3.00/M | 32K | ||
| Palmyra Fin Palmyra Fin | $1.00/M | $3.00/M | 32K | ||
| Z.ai: GLM 5.1 z-ai/glm-5.1 | $0.9800/M | $3.08/M | 203K | ||
| Switchpoint Router switchpoint/router | $0.8500/M | $3.40/M | 131K | ||
| Maestro Reasoning arcee-ai/maestro-reasoning | $0.9000/M | $3.30/M | 131K | ||
| Grok 4.20 x-ai/grok-4.20 | $1.25/M | $2.50/M | 2M | ||
| Grok 4.3 x-ai/grok-4.3 | $1.25/M | $2.50/M | 1M | ||
| Qwen3 Max Thinking qwen/qwen3-max-thinking | $0.7800/M | $3.90/M | 262K | ||
| Qwen3 Max qwen/qwen3-max | $0.7800/M | $3.90/M | 262K | ||
| Claude Haiku 4.5 Claude Haiku 4.5 | $0.8000/M | $4.00/M | 200K | ||
| Claude Haiku 3.5 Claude Haiku 3.5 | $0.8000/M | $4.00/M | 200K | ||
| Claude 3.5 Haiku anthropic/claude-3.5-haiku | $0.8000/M | $4.00/M | 200K | ||
| Reka Flash 21B Reka Flash 21B | $0.8000/M | $4.00/M | 128K | ||
| Moonshot v1 128k Moonshot v1 128k | $1.80/M | $1.80/M | 128K | ||
| GPT-5.4 Mini openai/gpt-5.4-mini | $0.7500/M | $4.50/M | 400K | ||
| Qwen3.7 Max qwen/qwen3.7-max | $1.25/M | $3.75/M | 1M | ||
| Qwen2-VL-72B Qwen2-VL-72B | $2.00/M | $2.00/M | 32K | ||
| Rerank v3.5 Rerank v3.5 | $2.00/M | — | — | ||
| Rerank v3 Multilingual Rerank v3 Multilingual | $2.00/M | — | — | ||
| Z.ai: GLM 5V Turbo z-ai/glm-5v-turbo | $1.20/M | $4.00/M | 203K | ||
| Z.ai: GLM 5 Turbo z-ai/glm-5-turbo | $1.20/M | $4.00/M | 203K | ||
| o3-mini o3-mini | $1.10/M | $4.40/M | 200K | ||
| o4-mini o4-mini | $1.10/M | $4.40/M | 200K | ||
| o4 Mini High openai/o4-mini-high | $1.10/M | $4.40/M | 200K | ||
| o4 Mini openai/o4-mini | $1.10/M | $4.40/M | 200K | ||
| o3 Mini High openai/o3-mini-high | $1.10/M | $4.40/M | 200K | ||
| o3 Mini openai/o3-mini | $1.10/M | $4.40/M | 200K | ||
| Palmyra Vision Palmyra Vision | $1.50/M | $3.50/M | 32K | ||
| Sonar Reasoning Sonar Reasoning | $1.00/M | $5.00/M | 127K | ||
| ABAB 6.5s ABAB 6.5s | $1.00/M | $5.00/M | 8K | ||
| Palmyra X5 writer/palmyra-x5 | $0.6000/M | $6.00/M | 1.0M | ||
| GPT-5 Image Mini openai/gpt-5-image-mini | $2.50/M | $2.00/M | 400K | ||
| Gemini 1.5 Pro Gemini 1.5 Pro | $1.25/M | $5.00/M | 2M | ||
| Qwen3.6 Max Preview qwen/qwen3.6-max-preview | $1.04/M | $6.24/M | 262K | ||
| Solar Pro Solar Pro | $1.50/M | $6.00/M | 32K | ||
| Llama 3.1 405B (Groq) Llama 3.1 405B (Groq) | $2.99/M | $2.99/M | 128K | ||
| Llama 3.1 405B (fw) Llama 3.1 405B (fw) | $3.00/M | $3.00/M | 131K | ||
| Yi-Large Yi-Large | $3.00/M | $3.00/M | 32K | ||
| Llama 3.1 70B Hanami x1 sao10k/l3.1-70b-hanami-x1 | $3.00/M | $3.00/M | 16K | ||
| Qwen2.5-Max Qwen2.5-Max | $1.60/M | $6.40/M | 32K | ||
| Grok 4.20 Multi-Agent x-ai/grok-4.20-multi-agent | $2.00/M | $6.00/M | 2M | ||
| Mistral Large 2411 mistralai/mistral-large-2411 | $2.00/M | $6.00/M | 131K | ||
| Mistral Large 2407 mistralai/mistral-large-2407 | $2.00/M | $6.00/M | 131K | ||
| Pixtral Large 2411 mistralai/pixtral-large-2411 | $2.00/M | $6.00/M | 131K | ||
| Mistral Large 2 Mistral Large 2 | $2.00/M | $6.00/M | 128K | ||
| Pixtral Large Pixtral Large | $2.00/M | $6.00/M | 128K | ||
| Mistral-Large-2 (NIM) Mistral-Large-2 (NIM) | $2.00/M | $6.00/M | 128K | ||
| Mistral Large mistralai/mistral-large | $2.00/M | $6.00/M | 128K | ||
| Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct | $2.00/M | $6.00/M | 66K | ||
| Mistral Medium 3.5 mistralai/mistral-medium-3-5 | $1.50/M | $7.50/M | 262K | ||
| GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k | $3.00/M | $4.00/M | 16K | ||
| Llama 3.1 405B (Together) Llama 3.1 405B (Together) | $3.50/M | $3.50/M | 128K | ||
| Yi-VL-34B Yi-VL-34B | $3.50/M | $3.50/M | 4K | ||
| Magnum v4 72B anthracite-org/magnum-v4-72b | $3.00/M | $5.00/M | 33K | ||
| Gemini 3.5 Flash google/gemini-3.5-flash | $1.50/M | $9.00/M | 1.0M | ||
| GPT-4.1 GPT-4.1 | $2.00/M | $8.00/M | 1M | ||
| Jamba 1.5 Large Jamba 1.5 Large | $2.00/M | $8.00/M | 256K | ||
| Jamba Large 1.7 ai21/jamba-large-1.7 | $2.00/M | $8.00/M | 256K | ||
| o4 Mini Deep Research openai/o4-mini-deep-research | $2.00/M | $8.00/M | 200K | ||
| Sonar Reasoning Pro Sonar Reasoning Pro | $2.00/M | $8.00/M | 127K | ||
| Sonar Deep Research Sonar Deep Research | $2.00/M | $8.00/M | 127K | ||
| Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview | $1.25/M | $10.00/M | 1.0M | ||
| Gemini 2.5 Pro Gemini 2.5 Pro | $1.25/M | $10.00/M | 1M | ||
| GPT-5 GPT-5 | $1.25/M | $10.00/M | 400K | ||
| GPT-5.1-Codex-Max openai/gpt-5.1-codex-max | $1.25/M | $10.00/M | 400K | ||
| GPT-5.1 openai/gpt-5.1 | $1.25/M | $10.00/M | 400K | ||
| GPT-5.1-Codex openai/gpt-5.1-codex | $1.25/M | $10.00/M | 400K | ||
| GPT-5 Codex openai/gpt-5-codex | $1.25/M | $10.00/M | 400K | ||
| GPT-5.1 Chat openai/gpt-5.1-chat | $1.25/M | $10.00/M | 128K | ||
| GPT-5 Chat openai/gpt-5-chat | $1.25/M | $10.00/M | 128K | ||
| Llama 3.1 405B (Hyp) Llama 3.1 405B (Hyp) | $4.00/M | $4.00/M | 128K | ||
| DeepSeek-R1 (Together) DeepSeek-R1 (Together) | $3.00/M | $7.00/M | 64K | ||
| Nemotron-4-340B Nemotron-4-340B | $4.20/M | $4.20/M | 4K | ||
| Grok 2 Grok 2 | $2.00/M | $10.00/M | 131K | ||
| Grok 2 Vision Grok 2 Vision | $2.00/M | $10.00/M | 32K | ||
| DeepSeek-R1 (fw) DeepSeek-R1 (fw) | $3.00/M | $8.00/M | 64K | ||
| Hunyuan-Turbo Hunyuan-Turbo | $3.50/M | $7.00/M | 256K | ||
| Command A Command A | $2.50/M | $10.00/M | 256K | ||
| GPT-4o GPT-4o | $2.50/M | $10.00/M | 128K | ||
| Command R+ Command R+ | $2.50/M | $10.00/M | 128K | ||
| Kimi k1.5 Kimi k1.5 | $2.50/M | $10.00/M | 128K | ||
| GPT Audio openai/gpt-audio | $2.50/M | $10.00/M | 128K | ||
| GPT-4o Audio openai/gpt-4o-audio-preview | $2.50/M | $10.00/M | 128K | ||
| GPT-4o Search Preview openai/gpt-4o-search-preview | $2.50/M | $10.00/M | 128K | ||
| Pi (Inflection-2.5) Pi (Inflection-2.5) | $2.50/M | $10.00/M | 32K | ||
| Inflection 3 Pi inflection/inflection-3-pi | $2.50/M | $10.00/M | 8K | ||
| Inflection 3 Productivity inflection/inflection-3-productivity | $2.50/M | $10.00/M | 8K | ||
| Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools | $2.00/M | $12.00/M | 1.0M | ||
| Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview | $2.00/M | $12.00/M | 1.0M | ||
| Sonar Huge Sonar Huge | $5.00/M | $5.00/M | 127K | ||
| Nano Banana Pro (Gemini 3 Pro Image Preview) google/gemini-3-pro-image-preview | $2.00/M | $12.00/M | 66K | ||
| AionLabs: Aion-1.0 aion-labs/aion-1.0 | $4.00/M | $8.00/M | 131K | ||
| GPT-5.3-Codex openai/gpt-5.3-codex | $1.75/M | $14.00/M | 400K | ||
| GPT-5.2-Codex openai/gpt-5.2-codex | $1.75/M | $14.00/M | 400K | ||
| GPT-5.2 openai/gpt-5.2 | $1.75/M | $14.00/M | 400K | ||
| GPT-5.3 Chat openai/gpt-5.3-chat | $1.75/M | $14.00/M | 128K | ||
| GPT-5.2 Chat openai/gpt-5.2-chat | $1.75/M | $14.00/M | 128K | ||
| Falcon 180B Falcon 180B | $3.50/M | $10.00/M | 4K | ||
| Nova Premier 1.0 amazon/nova-premier-v1 | $2.50/M | $12.50/M | 1M | ||
| Nova Premier Nova Premier | $2.50/M | $12.50/M | 300K | ||
| Hunyuan-Vision Hunyuan-Vision | $5.50/M | $5.50/M | 4K | ||
| ERNIE 4.0 Turbo 8K ERNIE 4.0 Turbo 8K | $3.50/M | $10.50/M | 8K | ||
| GPT-5.4 openai/gpt-5.4 | $2.50/M | $15.00/M | 1.1M | ||
| Claude Sonnet 4.6 Claude Sonnet 4.6 | $3.00/M | $15.00/M | 1M | ||
| Claude Sonnet 4 anthropic/claude-sonnet-4 | $3.00/M | $15.00/M | 1M | ||
| Claude Sonnet 4.5 Claude Sonnet 4.5 | $3.00/M | $15.00/M | 200K | ||
| Claude Sonnet 3.7 Claude Sonnet 3.7 | $3.00/M | $15.00/M | 200K | ||
| Claude Sonnet 3.5 Claude Sonnet 3.5 | $3.00/M | $15.00/M | 200K | ||
| Sonar Pro Sonar Pro | $3.00/M | $15.00/M | 200K | ||
| Sonar Pro Search perplexity/sonar-pro-search | $3.00/M | $15.00/M | 200K | ||
| Grok 3 Grok 3 | $3.00/M | $15.00/M | 131K | ||
| Reka Core Reka Core | $3.00/M | $15.00/M | 128K | ||
| ERNIE 4.0 8K ERNIE 4.0 8K | $4.20/M | $12.60/M | 8K | ||
| GLM-4-Plus GLM-4-Plus | $7.00/M | $7.00/M | 128K | ||
| GLM-4V GLM-4V | $7.00/M | $7.00/M | 2K | ||
| Llama 3.1 405B (Rep) Llama 3.1 405B (Rep) | $9.50/M | $9.50/M | 128K | ||
| GPT-5 Image openai/gpt-5-image | $10.00/M | $10.00/M | 400K | ||
| GPT-5.4 Image 2 openai/gpt-5.4-image-2 | $8.00/M | $15.00/M | 272K | ||
| Claude Opus 4.6 Claude Opus 4.6 | $5.00/M | $25.00/M | 1M | ||
| Claude Opus 4.7 anthropic/claude-opus-4.7 | $5.00/M | $25.00/M | 1M | ||
| Grok 3 Fast Grok 3 Fast | $5.00/M | $25.00/M | 131K | ||
| GPT-5.5 openai/gpt-5.5 | $5.00/M | $30.00/M | 1.1M | ||
| GPT Chat Latest openai/gpt-chat-latest | $5.00/M | $30.00/M | 400K | ||
| GPT-4 Turbo GPT-4 Turbo | $10.00/M | $30.00/M | 128K | ||
| GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview | $10.00/M | $30.00/M | 128K | ||
| o3 o3 | $10.00/M | $40.00/M | 200K | ||
| o3 Deep Research openai/o3-deep-research | $10.00/M | $40.00/M | 200K | ||
| o1 o1 | $15.00/M | $60.00/M | 200K | ||
| Claude Opus 4.5 Claude Opus 4.5 | $15.00/M | $75.00/M | 200K | ||
| Claude Opus 4.1 anthropic/claude-opus-4.1 | $15.00/M | $75.00/M | 200K | ||
| Claude Opus 4 anthropic/claude-opus-4 | $15.00/M | $75.00/M | 200K | ||
| o3 Pro openai/o3-pro | $20.00/M | $80.00/M | 200K | ||
| GPT-4 (older v0314) openai/gpt-4-0314 | $30.00/M | $60.00/M | 8K | ||
| GPT-5 Pro openai/gpt-5-pro | $15.00/M | $120.00/M | 400K | ||
| GPT-5.2 Pro openai/gpt-5.2-pro | $21.00/M | $168.00/M | 400K | ||
| Claude Opus 4.7 (Fast) anthropic/claude-opus-4.7-fast | $30.00/M | $150.00/M | 1M | ||
| Claude Opus 4.6 (Fast) anthropic/claude-opus-4.6-fast | $30.00/M | $150.00/M | 1M | ||
| GPT-5.5 Pro openai/gpt-5.5-pro | $30.00/M | $180.00/M | 1.1M | ||
| GPT-5.4 Pro openai/gpt-5.4-pro | $30.00/M | $180.00/M | 1.1M | ||
| o1-pro openai/o1-pro | $150.00/M | $600.00/M | 200K |