# BestToken — Complete Vendor Pricing Data > Data updated: 2026-05-15 > Currency: USD --- ## SECTION 1: TEXT/CODE LLM PRICING ### AI21 Labs | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Jamba Large 1.7 | Official API | 2 | 8 | - | - | - | | Jamba Large 1.7 | OpenRouter | 2 | 8 | - | - | - | | Jamba Large 1.7 | AWS Bedrock | 2 | 8 | - | - | - | | Jamba Mini 1.7 | Official API | 0.2 | 0.4 | - | - | - | | Jamba Mini 1.7 | OpenRouter | 0.2 | 0.4 | - | - | - | ### Alibaba | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Qwen3.6-Plus | Together AI | 0.5 | 3 | - | - | - | | Qwen3.5 397B A17B | Together AI | 0.6 | 3.6 | - | - | - | | Qwen3-Coder-Next | Together AI | 0.5 | 1.2 | - | - | - | | Qwen3-Coder 480B A35B | Together AI | 2 | 2 | - | - | - | | Qwen3-Coder 480B A35B | DeepInfra | 0.3 | 1 | 0.1 | - | - | | Qwen3-235B-A22B | Official API | 0.5 | 2 | - | - | - | | Qwen3-235B-A22B | Together AI | 0.2 | 0.6 | - | - | - | | Qwen3-235B-A22B | Fireworks AI | 0.2 | 0.6 | - | - | - | | Qwen3-235B-A22B | SiliconFlow | 0.5 | 2 | - | - | - | | Qwen3-235B-A22B | DeepInfra | 0.071 | 0.1 | - | - | - | | Qwen3-235B-A22B | Cerebras | 0.6 | 1.2 | - | - | - | | Qwen3-32B | Official API | 0.2 | 0.6 | - | - | - | | Qwen3-32B | Together AI | 0.2 | 0.6 | - | - | - | | Qwen3-32B | Groq | 0.29 | 0.59 | - | - | - | | Qwen3-32B | SiliconFlow | 0.2 | 0.6 | - | - | - | | Qwen3-32B | SambaNova | 0.4 | 0.8 | - | - | - | | QwQ-32B | Official API | 0.12 | 0.36 | - | - | - | | QwQ-32B | SiliconFlow | 0.12 | 0.36 | - | - | - | ### Anthropic | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Claude Opus 4.6 | Official API | 5 | 25 | 0.5 | - | - | | Claude Opus 4.6 | AWS Bedrock | 5 | 25 | 0.5 | - | - | | Claude Opus 4.6 | Google Vertex AI | 5 | 25 | - | - | - | | Claude Opus 4.6 | OpenRouter | 5 | 25 | - | - | - | | Claude Sonnet 4.6 | Official API | 3 | 15 | 0.3 | - | - | | Claude Sonnet 4.6 | AWS Bedrock | 3 | 15 | 0.3 | - | - | | Claude Sonnet 4.6 | Google Vertex AI | 3 | 15 | - | - | - | | Claude Sonnet 4.6 | OpenRouter | 3 | 15 | - | - | - | | Claude Haiku 4.5 | Official API | 1 | 5 | 0.1 | - | - | | Claude Haiku 4.5 | AWS Bedrock | 1 | 5 | - | - | - | | Claude Haiku 4.5 | Google Vertex AI | 1 | 5 | - | - | - | | Claude Opus 4 | Official API | 15 | 75 | 1.875 | - | - | | Claude Opus 4 | AWS Bedrock | 15 | 75 | 1.5 | - | - | | Claude Opus 4 | Google Vertex AI | 15 | 75 | - | - | - | | Claude Opus 4 | OpenRouter | 15 | 75 | - | - | - | | Claude Sonnet 4 | Official API | 3 | 15 | 0.3 | - | - | | Claude Sonnet 4 | AWS Bedrock | 3 | 15 | - | - | - | | Claude Sonnet 4 | Google Vertex AI | 3 | 15 | - | - | - | | Claude Opus 4 | Official API | 15 | 75 | 1.875 | - | - | | Claude Opus 4 | AWS Bedrock | 15 | 75 | 1.5 | - | - | | Claude Opus 4 | Google Vertex AI | 15 | 75 | - | - | - | | Claude Opus 4 | OpenRouter | 15 | 75 | - | - | - | | Claude Sonnet 4 | Official API | 3 | 15 | 0.3 | - | - | | Claude Sonnet 4 | AWS Bedrock | 3 | 15 | - | - | - | | Claude Sonnet 4 | Google Vertex AI | 3 | 15 | - | - | - | | Claude 3.5 Haiku | Official API | 0.8 | 4 | 0.08 | - | - | | Claude 3.5 Haiku | AWS Bedrock | 0.8 | 4 | - | - | - | ### Baidu | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | ERNIE 4.5 300B | 百度千帆 | 0.28 | 0.9 | - | - | - | | ERNIE 4.5 21B | 百度千帆 | 0.07 | 0.28 | - | - | - | | ERNIE 4.5 21B Thinking | 百度千帆 | 0.07 | 0.28 | - | - | - | | ERNIE 4.5 VL 424B | 百度千帆 | 0.42 | 1.25 | - | - | - | ### ByteDance | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Doubao-Pro | 火山引擎 | 0.11 | 0.42 | - | - | - | ### Cohere | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Command A | Official API | 2.5 | 10 | - | - | - | | Command A | OpenRouter | 2.5 | 10 | - | - | - | | Command A | SambaNova | 2.5 | 10 | - | - | - | | Command A | DeepInfra | 2.5 | 7.5 | - | - | - | | Command R7B | Official API | 0.04 | 0.15 | - | - | - | | Command R7B | DeepInfra | 0.02 | 0.06 | - | - | - | | Command R | Official API | 0.15 | 0.6 | - | - | - | | Command R | OpenRouter | 0.15 | 0.6 | - | - | - | | Command R | DeepInfra | 0.0625 | 0.25 | - | - | - | ### DeepCognito | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Cogito v2.1 671B | Together AI | 1.25 | 1.25 | - | - | - | ### DeepSeek | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | DeepSeek-V4 Pro | DeepInfra | 1.74 | 3.48 | 0.145 | - | - | | DeepSeek-V4 Pro | Together AI | 2.1 | 4.4 | 0.2 | - | - | | DeepSeek-V4 Flash | DeepInfra | 0.14 | 0.28 | 0.028 | - | - | | DeepSeek-V3.1 Terminus | DeepInfra | 0.27 | 0.95 | 0.13 | - | - | | DeepSeek-V4 | Official API | 0.3 | 0.5 | 0.03 | - | - | | DeepSeek-V4 | DeepInfra | 0.27 | 0.42 | 0.14 | - | - | | DeepSeek-V4 | SiliconFlow | 0.28 | 0.48 | - | - | - | | DeepSeek-V4 | OpenRouter | 0.3 | 0.5 | - | - | - | | DeepSeek-V4 | 火山引擎 | 0.31 | 0.52 | - | - | - | | DeepSeek-V4 | 百度千帆 | 0.33 | 0.55 | - | - | - | | DeepSeek-V3.1 | DeepInfra | 0.21 | 0.79 | 0.13 | - | - | | DeepSeek-V3.1 | Together AI | 0.6 | 1.7 | - | - | - | | DeepSeek-V3.1 | OpenRouter | 0.4 | 0.8 | - | - | - | | DeepSeek-R1-0528 Turbo | DeepInfra | 1 | 3 | - | - | - | | DeepSeek-V3.2 | Official API | 0.14 | 0.28 | 0.0028 | - | - | | DeepSeek-V3.2 | DeepInfra | 0.14 | 0.28 | 0.0028 | - | - | | DeepSeek-V3.2 | SiliconFlow | 0.14 | 0.28 | - | - | - | | DeepSeek-V3.2 | OpenRouter | 0.14 | 0.28 | - | - | - | | DeepSeek-V3 | Official API | 0.14 | 0.28 | 0.0028 | - | - | | DeepSeek-V3 | Together AI | 0.14 | 0.28 | - | - | - | | DeepSeek-V3 | Fireworks AI | 0.14 | 0.28 | - | - | - | | DeepSeek-V3 | SiliconFlow | 0.14 | 0.28 | - | - | - | | DeepSeek-V3 | OpenRouter | 0.14 | 0.28 | - | - | - | | DeepSeek-V3 | DeepInfra | 0.14 | 0.28 | 0.0028 | - | - | | DeepSeek-V3 | SambaNova | 0.14 | 0.28 | - | - | - | | DeepSeek-V3 | 火山引擎 | 0.14 | 0.28 | - | - | - | | DeepSeek-V3 | 百度千帆 | 0.14 | 0.28 | - | - | - | | DeepSeek-R1 | Official API | 0.14 | 0.28 | 0.0028 | - | - | | DeepSeek-R1 | Together AI | 0.14 | 0.28 | - | - | - | | DeepSeek-R1 | Fireworks AI | 0.14 | 0.28 | - | - | - | | DeepSeek-R1 | SiliconFlow | 0.14 | 0.28 | - | - | - | | DeepSeek-R1 | OpenRouter | 0.14 | 0.28 | - | - | - | | DeepSeek-R1 | DeepInfra | 0.14 | 0.28 | 0.0028 | - | - | | DeepSeek-R1 | SambaNova | 0.14 | 0.28 | - | - | - | | DeepSeek-R1 | Nebius AI | 0.14 | 0.28 | - | - | - | | DeepSeek-R1 | 火山引擎 | 0.14 | 0.28 | - | - | - | | DeepSeek-R1 | 百度千帆 | 0.14 | 0.28 | - | - | - | ### Essential AI | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Rnj-1 Instruct | Together AI | 0.15 | 0.15 | - | - | - | ### Google | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Gemma 4 31B | DeepInfra | 0.12 | 0.37 | - | - | - | | Gemma 4 31B | Together AI | 0.2 | 0.5 | - | - | - | | Gemma 4 26B A4B | DeepInfra | 0.07 | 0.34 | - | - | - | | Gemini 3.1 Flash Lite | Official API | 0.25 | 1.5 | 0.025 | - | - | | Gemini 3.1 Flash Lite | Google Vertex AI | 0.25 | 1.5 | - | - | - | | Gemini 3.1 Flash Lite | OpenRouter | 0.25 | 1.5 | - | - | - | | Gemini 3.1 Pro | Official API | 2 | 12 | 0.2 | - | - | | Gemini 3.1 Pro | Google Vertex AI | 2 | 12 | - | - | - | | Gemini 3.1 Pro | OpenRouter | 2 | 12 | - | - | - | | Gemini 3 Flash | Official API | 0.5 | 3 | 0.05 | - | - | | Gemini 3 Flash | Google Vertex AI | 0.5 | 3 | - | - | - | | Gemini 3 Flash | OpenRouter | 0.5 | 3 | - | - | - | | Gemini 2.5 Flash | Official API | 0.5 | 3 | - | - | - | | Gemini 2.5 Flash | Google Vertex AI | 0.5 | 3 | - | - | - | | Gemini 2.5 Pro | Official API | 2 | 12 | - | - | - | | Gemini 2.5 Pro | Google Vertex AI | 2 | 12 | - | - | - | | Gemini 2.5 Pro | OpenRouter | 2 | 12 | - | - | - | | Gemini 2.0 Flash | Official API | 0.5 | 3 | - | - | - | | Gemini 2.0 Flash | Google Vertex AI | 0.5 | 3 | - | - | - | ### Liquid AI | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | LFM2 24B A2B | Together AI | 0.03 | 0.12 | - | - | - | ### Meta | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Llama 4 Maverick | Together AI | 0.27 | 0.85 | - | - | - | | Llama 4 Maverick | Fireworks AI | 0.15 | 0.6 | - | - | - | | Llama 4 Maverick | Groq | 0.2 | 0.6 | - | - | - | | Llama 4 Maverick | OpenRouter | 0.22 | 0.85 | - | - | - | | Llama 4 Maverick | AWS Bedrock | 0.27 | 0.36 | - | - | - | | Llama 4 Maverick | SambaNova | 0.63 | 1.8 | - | - | - | | Llama 4 Maverick | DeepInfra | 0.2 | 0.6 | - | - | - | | Llama 4 Scout | Together AI | 0.12 | 0.38 | - | - | - | | Llama 4 Scout | Fireworks AI | 0.1 | 0.3 | - | - | - | | Llama 4 Scout | Groq | 0.11 | 0.34 | - | - | - | | Llama 4 Scout | OpenRouter | 0.12 | 0.34 | - | - | - | | Llama 4 Scout | SiliconFlow | 0.1 | 0.3 | - | - | - | | Llama 4 Scout | DeepInfra | 0.08 | 0.24 | - | - | - | | Llama 3.3 70B | Together AI | 0.59 | 0.79 | - | - | - | | Llama 3.3 70B | Fireworks AI | 0.9 | 0.9 | - | - | - | | Llama 3.3 70B | Groq | 0.59 | 0.79 | - | - | - | | Llama 3.3 70B | OpenRouter | 0.1 | 0.32 | - | - | - | | Llama 3.3 70B | AWS Bedrock | 2.65 | 3.5 | - | - | - | | Llama 3.3 70B | SiliconFlow | 0.59 | 0.79 | - | - | - | | Llama 3.3 70B | DeepInfra | 0.6 | 0.6 | - | - | - | | Llama 3.3 70B | SambaNova | 0.6 | 1.2 | - | - | - | | Llama 3.3 70B | Nebius AI | 0.13 | 0.4 | - | - | - | | Llama 3.1 8B | Together AI | 0.05 | 0.08 | - | - | - | | Llama 3.1 8B | Fireworks AI | 0.03 | 0.05 | - | - | - | | Llama 3.1 8B | Groq | 0.05 | 0.08 | - | - | - | | Llama 3.1 8B | OpenRouter | 0.04 | 0.08 | - | - | - | | Llama 3.1 8B | SiliconFlow | 0.04 | 0.08 | - | - | - | | Llama 3.1 8B | DeepInfra | 0.02 | 0.03 | - | - | - | | Llama 3.1 8B | Cerebras | 0.1 | 0.1 | - | - | - | | Llama 3.1 8B | SambaNova | 0.1 | 0.2 | - | - | - | | Llama 3.1 8B | Nebius AI | 0.03 | 0.09 | - | - | - | ### MiniMax | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | MiniMax M2.7 | Together AI | 0.3 | 1.2 | 0.06 | - | - | | MiniMax M2.7 | OpenRouter | 0.279 | 1.2 | - | - | - | | MiniMax M2.5 | Together AI | 0.3 | 1.2 | 0.06 | - | - | | MiniMax M2.5 | OpenRouter | 0.15 | 1.15 | - | - | - | ### Mistral | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Mistral Large 3 | Official API | 0.5 | 1.5 | 0.05 | - | - | | Mistral Large 3 | OpenRouter | 0.5 | 1.5 | - | - | - | | Mistral Large 3 | DeepInfra | 0.5 | 1.5 | 0.07 | - | - | | Mistral Large 3 | Together AI | 0.5 | 1.5 | - | - | - | | Mistral Large 3 | Fireworks AI | 0.5 | 1.5 | - | - | - | | Mistral Large 3 | Azure OpenAI | 0.5 | 1.5 | - | - | - | | Mistral Large 3 | AWS Bedrock | 0.5 | 1.5 | - | - | - | | Mistral Medium 3 | Official API | 1.5 | 7.5 | 0.1 | - | - | | Mistral Medium 3 | OpenRouter | 1.5 | 7.5 | - | - | - | | Mistral Medium 3 | DeepInfra | 1.5 | 7.5 | - | - | - | | Mistral Small 3.2 24B | DeepInfra | 0.075 | 0.2 | - | - | - | | Mistral Small 3.2 24B | Official API | 0.1 | 0.3 | - | - | - | | Mistral Small 3.1 | Official API | 0.15 | 0.6 | 0.02 | - | - | | Mistral Small 3.1 | OpenRouter | 0.15 | 0.6 | - | - | - | | Mistral Small 3.1 | DeepInfra | 0.15 | 0.6 | - | - | - | | Mistral Small 3.1 | Together AI | 0.15 | 0.6 | - | - | - | | Mistral Small 3.1 | Fireworks AI | 0.15 | 0.6 | - | - | - | ### Moonshot AI | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Kimi K2.6 | Together AI | 1.2 | 4.5 | 0.2 | - | - | | Kimi K2.6 | OpenRouter | 1.5 | 5 | - | - | - | | Kimi K2.5 | Kimi | 0.383 | 1.72 | 0.07 | - | - | | Kimi K2.5 | 火山引擎 | 0.42 | 1.8 | - | - | - | | Kimi K2 Thinking | Kimi | 0.47 | 2 | 0.141 | - | - | | Kimi K2 Thinking | 火山引擎 | 0.5 | 2.2 | - | - | - | ### NVIDIA | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | Nemotron-3 Nano Omni 30B A3B Reasoning | DeepInfra | 0.2 | 0.8 | - | - | - | | Nemotron-3 Super 120B A12B | DeepInfra | 0.1 | 0.5 | - | - | - | | Nemotron-3 Super 120B A12B | Together AI | 0.1 | 0.5 | - | - | - | | Nemotron-3 Nano 30B A3B | DeepInfra | 0.05 | 0.2 | - | - | - | | Llama 3.3 Nemotron Super 49B | DeepInfra | 0.1 | 0.4 | - | - | - | ### OpenAI | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | GPT-5.5 | Official API | 5 | 30 | 0.5 | 2.5 | 15 | | GPT-5.5 | Azure OpenAI | 5 | 30 | 0.5 | - | - | | GPT-5.5 | OpenRouter | 5 | 30 | 0.5 | - | - | | GPT-5.4 Pro | Official API | 30 | 180 | 3 | - | - | | GPT-5.4 Pro | Azure OpenAI | 30 | 180 | - | - | - | | gpt-oss-120B | Together AI | 0.15 | 0.6 | - | - | - | | gpt-oss-120B | DeepInfra | 0.14 | 0.55 | - | - | - | | gpt-oss-20B | Together AI | 0.05 | 0.2 | - | - | - | | gpt-oss-20B | DeepInfra | 0.05 | 0.2 | - | - | - | | GPT-5.4 | Official API | 2.5 | 15 | 0.25 | - | - | | GPT-5.4 | Azure OpenAI | 2.5 | 15 | 0.25 | - | - | | GPT-5.4 | OpenRouter | 2.5 | 15 | 0.25 | - | - | | GPT-5.4 mini | Official API | 0.75 | 4.5 | 0.075 | - | - | | GPT-5.4 mini | Azure OpenAI | 0.75 | 4.5 | - | - | - | | GPT-5.4 mini | OpenRouter | 0.75 | 4.5 | - | - | - | | GPT-5.4 nano | Official API | 0.2 | 1.25 | 0.02 | - | - | | GPT-5.4 nano | Azure OpenAI | 0.2 | 1.25 | - | - | - | | GPT-5.2 | Official API | 1.75 | 14 | 0.175 | - | - | | GPT-5.2 | Azure OpenAI | 1.75 | 14 | - | - | - | | GPT-5.2 | OpenRouter | 1.75 | 14 | - | - | - | | GPT-5 nano | Official API | 0.025 | 0.2 | 0.0025 | - | - | | GPT-5 nano | Azure OpenAI | 0.025 | 0.2 | - | - | - | | GPT-5 mini | Official API | 0.45 | 3.6 | 0.045 | - | - | | GPT-5 mini | OpenRouter | 0.45 | 3.6 | - | - | - | | o3 | Official API | 1 | 4 | 0.25 | - | - | | o3 | Azure OpenAI | 1 | 4 | - | - | - | | o4-mini | Official API | 0.55 | 2.2 | 0.1375 | - | - | | o4-mini | Azure OpenAI | 0.55 | 2.2 | - | - | - | | GPT-4.1 | Official API | 2 | 8 | 0.5 | 1 | 4 | | GPT-4.1 | Azure OpenAI | 2 | 8 | 0.5 | - | - | | GPT-4.1 | OpenRouter | 2 | 8 | 0.5 | - | - | | GPT-4.1 mini | Official API | 0.4 | 1.6 | 0.1 | - | - | | GPT-4.1 mini | Azure OpenAI | 0.4 | 1.6 | - | - | - | | GPT-4.1 mini | OpenRouter | 0.4 | 1.6 | - | - | - | | GPT-4.1 nano | Official API | 0.1 | 0.4 | 0.025 | - | - | | GPT-4.1 nano | Azure OpenAI | 0.1 | 0.4 | - | - | - | | GPT-4o | Official API | 2.5 | 10 | 1.25 | - | - | | GPT-4o | Azure OpenAI | 2.5 | 10 | 1.25 | - | - | | GPT-4o | AWS Bedrock | 2.5 | 10 | - | - | - | | GPT-4o | OpenRouter | 2.5 | 10 | - | - | - | ### 智谱 | Model | Vendor | Input ($/M) | Output ($/M) | Cached Input | Batch In | Batch Out | |-------|--------|------------|--------------|--------------|----------|-----------| | GLM-5.1 | 智谱 (GLM) | 1.4 | 4.4 | 0.28 | - | - | | GLM-5.1 | OpenRouter | 1.4 | 4.4 | - | - | - | | GLM-5 Turbo | 智谱 (GLM) | 1.2 | 4 | 0.24 | - | - | | GLM-5 Turbo | OpenRouter | 1.2 | 4 | - | - | - | | GLM-5 | 智谱 (GLM) | 1 | 3.2 | 0.2 | - | - | | GLM-5 | OpenRouter | 1 | 3.2 | 0.2 | - | - | | GLM-5 | 火山引擎 | 0.56 | 2.5 | - | - | - | | GLM-4.7-FlashX | 智谱 (GLM) | 0.07 | 0.4 | - | - | - | | GLM-4.7-FlashX | OpenRouter | 0.06 | 0.4 | - | - | - | | GLM-4.7 | 智谱 (GLM) | 0.6 | 2.2 | 0.11 | - | - | | GLM-4.7 | OpenRouter | 0.39 | 1.75 | - | - | - | | GLM-4.7 | 火山引擎 | 0.42 | 1.53 | - | - | - | | GLM-4-Plus | 智谱 (GLM) | 0.69 | 0.69 | - | - | - | | GLM-4-Plus | 火山引擎 | 0.56 | 0.56 | - | - | - | | GLM-4-FlashX | 智谱 (GLM) | 0.014 | 0.014 | - | - | - | | GLM-Z1-Air | 智谱 (GLM) | 0.069 | 0.069 | - | - | - | | GLM-Z1-Air | 火山引擎 | 0.083 | 0.083 | - | - | - | --- ## SECTION 2: IMAGE MODEL PRICING ### GPT Image 1.5 (OpenAI) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.009 | Low | | Official API | $0.034 | Medium | | Official API | $0.133 | High | ### GPT Image 1 Mini (OpenAI) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.005 | Low | | Official API | $0.015 | Medium | | Official API | $0.036 | High | ### DALL-E 3 (OpenAI) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.04 | Standard | | Official API | $0.08 | HD | | Azure OpenAI | $0.04 | Standard | | Azure OpenAI | $0.08 | HD | ### Stable Diffusion 3.5 Large (Stability AI) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.065 | Standard | | fal.ai | $0.025 | Standard | | Replicate | $0.035 | Standard | ### SD 3.5 Large Turbo (Stability AI) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.04 | Standard | | fal.ai | $0.015 | Standard | | Replicate | $0.02 | Standard | ### Stable Image Ultra (Stability AI) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.08 | Ultra | ### SDXL 1.0 (Stability AI) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.009 | Standard | | fal.ai | $0.004 | Standard | | Replicate | $0.005 | Standard | ### FLUX.2 Pro (Black Forest Labs) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.03 | 1MP | | fal.ai | $0.015 | 1MP | | Replicate | $0.03 | 1MP | | Together AI | $0.025 | 1MP | ### FLUX.2 Max (Black Forest Labs) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.07 | 1MP | | fal.ai | $0.04 | 1MP | ### FLUX.2 Klein 4B (Black Forest Labs) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.014 | 1MP | | fal.ai | $0.008 | 1MP | ### Imagen 4 (Google) | Vendor | Price/Image | Label | |--------|------------|-------| | Google Vertex AI | $0.04 | Standard | ### Imagen 4 Ultra (Google) | Vendor | Price/Image | Label | |--------|------------|-------| | Google Vertex AI | $0.06 | Ultra | ### Imagen 4 Fast (Google) | Vendor | Price/Image | Label | |--------|------------|-------| | Google Vertex AI | $0.02 | Fast | ### Grok Imagine (xAI) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.02 | Standard | | Official API | $0.07 | Pro | ### Midjourney V7 (Midjourney) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.12 | Basic ($10/mo) | | Official API | $0.06 | Standard ($30/mo, Relax) | ### FLUX.2 dev (Black Forest Labs) | Vendor | Price/Image | Label | |--------|------------|-------| | DeepInfra | $0.009 | 1024x1024 @ 28 steps | | Together AI | $0.0154 | Standard | ### FLUX.2 flex (Black Forest Labs) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.03 | Standard | ### FLUX.2 Klein 9B (Black Forest Labs) | Vendor | Price/Image | Label | |--------|------------|-------| | DeepInfra | $0.015 | 1024x1024 | ### Wan 2.6 Image (Alibaba) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.03 | Standard | ### Nano Banana 2 (Gemini 3.1 Flash Image) (Google) | Vendor | Price/Image | Label | |--------|------------|-------| | Official API | $0.05 | 1024x1024 | | Together AI | $0.05 | Standard | ### Nano Banana Pro (Gemini 3 Pro Image) (Google) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.134 | Standard | ### Imagen 4.0 Fast (Google) | Vendor | Price/Image | Label | |--------|------------|-------| | Google Vertex AI | $0.02 | 1024x1024 | | Together AI | $0.02 | Standard | ### Imagen 4.0 Ultra (Google) | Vendor | Price/Image | Label | |--------|------------|-------| | Google Vertex AI | $0.06 | 1024x1024 | | Together AI | $0.06 | Standard | ### Qwen Image 2.0 Pro (Alibaba) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.08 | Standard | ### Qwen Image 2.0 (Alibaba) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.04 | Standard | ### Ideogram 3.0 (Ideogram) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.06 | Standard | ### Seedream 3.0 (ByteDance) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.018 | Standard | ### Seedream 4.0 (ByteDance) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.03 | Standard | ### HiDream-I1 Full (HiDream AI) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.009 | Full | ### HiDream-I1 Dev (HiDream AI) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.0045 | Dev | ### HiDream-I1 Fast (HiDream AI) | Vendor | Price/Image | Label | |--------|------------|-------| | Together AI | $0.0032 | Fast | --- ## SECTION 3: VIDEO MODEL PRICING ### Sora 2 (OpenAI) | Vendor | Price/Second | Label | |--------|-------------|-------| | Official API | $0.1 | 720p | | Official API | $0.3 | Pro 720p | | Official API | $0.5 | Pro 1080p+ | | fal.ai | $0.1 | 720p | ### Veo 3 (Google) | Vendor | Price/Second | Label | |--------|-------------|-------| | Google Vertex AI | $0.15 | Veo 3 Fast 720p | | Google Vertex AI | $0.4 | Veo 3 1080p | | fal.ai | $0.15 | Veo 3 Fast | ### Kling 3.0 (快手 (Kling)) | Vendor | Price/Second | Label | |--------|-------------|-------| | fal.ai | $0.029 | Standard 720p | | fal.ai | $0.07 | Pro 1080p | | Replicate | $0.032 | Standard 720p | ### Runway Gen-4 (Runway) | Vendor | Price/Second | Label | |--------|-------------|-------| | Official API | $0.05 | Gen-4 Turbo | | Official API | $0.12 | Gen-4 Aleph | ### Pika 2.5 (Pika) | Vendor | Price/Second | Label | |--------|-------------|-------| | fal.ai | $0.04 | 720p 5s | | fal.ai | $0.09 | 1080p 5s | ### Luma Ray 2 (Luma AI) | Vendor | Price/Second | Label | |--------|-------------|-------| | Official API | $0.1 | 720p | | fal.ai | $0.142 | 720p | ### Seedance 2.0 (ByteDance) | Vendor | Price/Second | Label | |--------|-------------|-------| | Official API | $0.002 | 720p | | 火山引擎 | $0.14 | Pure Generation | | 火山引擎 | $0.095 | Video Editing | | fal.ai | $0.014 | 720p | | fal.ai | $0.005 | 480p | ### Seedance 1.5 Pro (ByteDance) | Vendor | Price/Second | Label | |--------|-------------|-------| | fal.ai | $0.052 | 720p w/audio | | fal.ai | $0.026 | 720p no audio | ### Seedance 1.0 Pro (ByteDance) | Vendor | Price/Second | Label | |--------|-------------|-------| | fal.ai | $0.124 | 1080p | | fal.ai | $0.055 | 720p | ### Kling 2.1 Master (快手 (Kling)) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.092 | 1080p | ### Kling 2.1 Pro (快手 (Kling)) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.064 | 1080p | ### Kling 2.1 Standard (快手 (Kling)) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.036 | 1080p | ### Kling 2.0 Master (快手 (Kling)) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.092 | 1080p | ### Kling 1.6 Pro (快手 (Kling)) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.064 | 1080p | ### Kling 1.6 Standard (快手 (Kling)) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.038 | 1080p | ### Veo 3.0 Fast (Google) | Vendor | Price/Second | Label | |--------|-------------|-------| | Google Vertex AI | $0.15 | Veo 3 Fast 720p | | Google Vertex AI | $0.3 | Veo 3 Fast 1080p | | Together AI | $0.1 | 720p | ### Veo 3.0 Fast + Audio (Google) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.15 | 1080p+Audio | ### Veo 3.0 + Audio (Google) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.4 | 1080p+Audio | ### Wan 2.2 I2V (Alibaba) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.062 | 1080p | ### Wan 2.2 T2V (Alibaba) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.132 | 1080p | ### MiniMax Hailuo 02 (MiniMax) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.082 | 1080p | ### MiniMax 01 Director (MiniMax) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.047 | 720p | ### Vidu 2.0 (Vidu) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.035 | 1080p | ### Vidu Q1 (Vidu) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.028 | 1080p | ### PixVerse v5 (PixVerse) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.075 | 1080p | ### Seedance 1.0 Lite (ByteDance) | Vendor | Price/Second | Label | |--------|-------------|-------| | Together AI | $0.035 | 720p | --- ## SECTION 4: FREE MODELS & OPEN WEIGHTS ### ChatAnywhere Free API (chatanywhere) - Context: 128,000 tokens - Free platforms: GitHub, 申请Key ### Free LLM API Resources (cheahjs) - Context: 128,000 tokens - Free platforms: GitHub ### AIClient 2 API (justlovemaki) - Context: 128,000 tokens - Free platforms: GitHub ### Free ChatGPT API (popjane) - Context: 128,000 tokens - Free platforms: GitHub ### Llama 3.1 70B (Meta) - Context: 128,000 tokens - Free platforms: HuggingFace, Ollama ### Qwen 2.5 72B (Alibaba) - Context: 32,768 tokens - Free platforms: HuggingFace, Ollama ### DeepSeek Chat (DeepSeek) - Context: 64,000 tokens - Free platforms: HuggingFace, Ollama ### Phi-4 (Microsoft) - Context: 16,000 tokens - Free platforms: HuggingFace, Ollama ### Mistral Large 2 (Mistral) - Context: 128,000 tokens - Free platforms: HuggingFace ### Gemma 2 27B (Google) - Context: 8,192 tokens - Free platforms: HuggingFace, Ollama ### Command R+ (Cohere) - Context: 128,000 tokens - Free platforms: HuggingFace ### CodeLlama 70B (Meta) - Context: 100,000 tokens - Free platforms: HuggingFace, Ollama ### Stable LM 2 12B (Stability AI) - Context: 4,096 tokens - Free platforms: HuggingFace ### Aya Expanse 8B (Cohere) - Context: 8,192 tokens - Free platforms: HuggingFace ### Llama 3.2 1B (Meta) - Context: 128,000 tokens - Free platforms: HuggingFace, Ollama ### SmolLM 1.7B (Meta) - Context: 4,096 tokens - Free platforms: HuggingFace, Ollama --- ## SECTION 5: WEB SERVICE SUBSCRIPTIONS ### ChatGPT | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Go | $8 | - | No | | Plus | $20 | - | No | | Pro | $200 | - | No | ### Claude | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $20 | $17/mo | No | | Max 5x | $100 | - | No | | Max 20x | $200 | - | No | ### Gemini | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Advanced | $19.99 | - | No | | Ultra | $249.99 | - | No | ### Perplexity | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $20 | $17/mo | No | ### Grok | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Premium | $8 | - | No | | Premium+ | $30 | - | No | ### GitHub Copilot | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $10 | $8.33/mo | No | | Business | $19 | - | No | | Enterprise | $39 | - | No | ### Cursor | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $20 | $16/mo | No | | Ultra | $200 | - | No | | Business | $40 | - | No | ### Windsurf | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $20 | - | No | | Max | $200 | - | No | ### Midjourney | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Basic | $10 | $8/mo | No | | Standard | $30 | $24/mo | No | | Pro | $60 | $48/mo | No | | Mega | $120 | $96/mo | No | ### Runway | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Basic | $12 | - | No | | Standard | $28 | - | No | | Pro | $76 | - | No | | Unlimited | $188 | - | No | ### Pika | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Standard | $8 | $8/mo | No | | Pro | $28 | - | No | | Fancy | $76 | - | No | ### Kling AI | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Standard | $10 | - | No | | Pro | $35 | - | No | | Premier | $49 | - | No | ### Dreamina (即梦) | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $10 | - | No | ### HeyGen | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Creator | $29 | - | No | | Business | $89 | - | No | ### v0 by Vercel | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Premium | $20 | - | No | ### Mistral | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $14.99 | - | No | | Team | $24.99 | - | No | ### HuggingFace | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $9 | - | No | | Enterprise | $0 | - | No | ### Poe | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $19.99 | - | No | ### Upstage | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $0 | - | No | ### NVIDIA NIM | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Self-Hosted | $0 | - | No | ### Xiaomi AI | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pay-as-you-go | $0 | - | No | ### NanoGPT | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Pay-as-you-go | $0 | - | No | ### Ollama Cloud | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Cloud | $0 | - | No | ### Vercel AI Gateway | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $20 | - | No | | Enterprise | $0 | - | No | ### v0 | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Premium | $20 | - | No | ### GitHub Models | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Production | $0 | - | No | ### GitLab Duo | Tier | Monthly Price | Annual Price | Free? | |------|--------------|-------------|-------| | Free | $0 | - | Yes | | Pro | $19 | - | No | | Enterprise | $39 | - | No | --- ## FAQ ### Q: Which LLM API is the cheapest? **A:** Top 5 cheapest (official API, input price): GPT-5 nano $0.025/M, Command R7B $0.04/M, GPT-4.1 nano $0.1/M, Mistral Small 3.2 24B $0.1/M, QwQ-32B $0.12/M --- BestToken: https://www.uwa4d.cn Data source: Official vendor pricing pages Last verified: 2026-05-15