Large Model API Pricing
Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.
Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.
Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.
Anthropic的Claude模型提供先进的安全AI能力,专注于有用、无害、诚实的AI助手体验,并具备强大的推理和对话能力。
| Model Name | Input Token Range | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|---|
| claude-haiku-4-5-20251001 | 1–2,000 | 20,000 | $5 | $13 (5m) | $12 | $10 | Go check it out |
| 2,000–10,000 | 20,000 | $3 | $8 (5m) | $8 | $4 | Go check it out | |
| claude-sonnet-4-5-20250929 | 1–2,000 | 200,000 | $2 | $4(5m) × $8(1h)$ | $4 | $2 | Go check it out |
| 2,000–20,000 | 200,000 | $4 | $6(5 min) × $10(1 hr)$ | $6 | $4 | Go check it out | |
| claude-3-7-sonnet-20250219 | - | 200,000 | $3 | $3.75 (5 m) | $0.3 | $15 | Go check it out |
| claude-sonnet-4-20250514 | - | 200,000 | $3 | $3.75 (5 min) · $6.60 (1 hr) | $0.3 | $15 | Go check it out |
| claude-opus-4-20250514 | - | 200,000 | $15 | $18.75 (5 m) | $1.5 | $75 | Go check it out |
| claude-opus-4-1-20250805 | - | 200,000 | $15 | $18.75 (5 m) | $1.5 | $75 | Go check it out |
| claude-3-5-sonnet-20241022 | - | 200,000 | $3 | $3.75 (5 m) | $0.3 | $15 | Go check it out |
| claude-3-haiku-20240307 | - | 200,000 | $0.25 | - | - | $1.25 | Go check it out |
| claude-3-5-haiku-20241022 | - | 200,000 | $0.8 | - | - | $4 | Go check it out |
OpenAI's GPT series of models offer state-of-the-art language understanding and generation capabilities, delivering outstanding performance across a wide range of tasks, and are among the industry's leading AI models.
| Model Name | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|---|---|
| gpt-5-codex | 400,000 | $1.25 | - | $0.125 | $10 | Go check it out |
| OpenAI GPT OSS 120B | 131,072 | $0.1 | - | - | $0.5 | Go check it out |
| OpenAI: GPT OSS 20B | 131,072 | $0.05 | - | - | $0.2 | Go check it out |
| gpt-5 | 400,000 | $1.25 | $0.1(5m) \times $0.2(1h) | $0.125 | $10 | Go check it out |
| gpt-5-mini | 400,000 | $0.25 | - | $0.025 | $2 | Go check it out |
| gpt-5-nano | 400,000 | $0.05 | - | $0.005 | $0.4 | Go check it out |
| gpt-5-pro | 400,000 | $15 | $1 (1 hour) | - | $120 | Go check it out |
| gpt-5-chat-latest | 400,000 | $1.25 | - | $0.125 | $10 | Go check it out |
| gpt-4.1-mini | 1,047,576 | $0.4 | - | $0.1 | $1.6 | Go check it out |
| gpt-4.1-nano | 1,047,576 | $0.1 | - | $0.025 | $0.4 | Go check it out |
| gpt-4.1 | 1,047,576 | $2 | - | $0.5 | $8 | Go check it out |
| gpt-4o-mini | 131,072 | $0.15 | - | $0.075 | $0.6 | Go check it out |
| gpt-4o | 131,072 | $2.5 | - | $1.25 | $10 | Go check it out |
Google的Gemini模型提供高质量的语言处理能力,在各种NLP任务中表现出色,并具备强大的多模态能力。
| Model Name | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|---|---|
| Gemma3 12B | 131,072 | $0.05 | - | - | $0.1 | Go check it out |
| gemini-2.5-flash | 1,048,576 | $0.3 | $0.083 (5m) | $0.075 | $2.5 | Go check it out |
| gemini-2.5-pro | 1,048,576 | $1.25 | $0.375 (5m) | $0.3125 | $10 | Go check it out |
| Gemma 3 27B | 32,768 | $0.119 | - | - | $0.2 | Go check it out |
| gemini-3.1-flash-lite-preview | 1,000,000 | $1 | $2(5m) \times $2(1h) | $1 | $2 | Go check it out |
| gemini-2.5-flash-lite-preview-09-2025 | 1,048,576 | $0.1 | $0.083 (5m) | $0.01 | $0.4 | Go check it out |
| gemini-2.0-flash-lite | 1,048,576 | $0.075 | $0.083 (5m) | $0.0188 | $0.3 | Go check it out |
| gemini-2.5-flash-lite | 1,048,576 | $0.1 | $0.083 (5m) | $0.025 | $0.4 | Go check it out |
| gemini-2.5-flash-lite-preview-06-17 | 1,048,576 | $0.1 | - | - | $0.4 | Go check it out |
| gemini-2.5-flash-preview-05-20 | 1,048,576 | $0.15 | - | - | $3.5 | Go check it out |
| gemini-2.5-pro-preview-06-05 | 1,048,576 | $1.25 | - | - | $10 | Go check it out |
| gemini-2.0-flash-20250609 | 1,048,576 | $0.15 | - | - | $0.6 | Go check it out |
Meta's Llama model offers state-of-the-art language understanding capabilities and features an open architecture, making it suitable for a wide range of applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Llama 3.1 8B Instruct | 16,384 | $0.02 | $0.05 | Go check it out |
| Llama 3.3 70B Instruct | 131,072 | $0.13 | $0.39 | Go check it out |
| Llama 4 Maverick Instruct | 1,048,576 | $0.17 | $0.85 | Go check it out |
| Llama 4 Scout Instruct | 131,072 | $0.1 | $0.5 | Go check it out |
| Llama 3.2 3B Instruct | 32,768 | $0.03 | $0.05 | Go check it out |
Qwen系列模型提供高效的语言处理能力,具有多种参数规模,涵盖从轻量级到企业级的解决方案。
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Qwen/Qwen3-8B | - | Free | Free | Go check it out |
| Qwen3 Next 80B A3B Thinking | 65,536 | $0.15 | $1.5 | Go check it out |
| Qwen3 Coder 480B A35B Instruct | 262,144 | $0.29 | $1.2 | Go check it out |
| Qwen3 235B A22b Thinking 2507 | 131,072 | $0.3 | $3 | Go check it out |
| Qwen3 235B A22B Instruct 2507 | 131,072 | $0.15 | $0.8 | Go check it out |
| Qwen 2.5 72B Instruct | 32,000 | $0.38 | $0.4 | Go check it out |
| Qwen3 235B A22B | 40,960 | $0.2 | $0.8 | Go check it out |
| Qwen2.5 VL 72B Instruct | 32,768 | $0.8 | $0.8 | Go check it out |
| Qwen3 32B | 40,960 | $0.1 | $0.45 | Go check it out |
| Qwen3 30B A3B | 40,960 | $0.09 | $0.45 | Go check it out |
| Qwen3 Next 80B A3B Instruct | 65,536 | $0.15 | $1.5 | Go check it out |
| Qwen MT Plus | 4,096 | $0.25 | $0.75 | Go check it out |
| Qwen3 8B | 128,000 | $0.035 | $0.138 | Go check it out |
| Qwen2.5 7B Instruct | 32,000 | $0.07 | $0.07 | Go check it out |
百度的ERNIE模型提供先进的中文语言理解和多模态能力,针对中文应用进行了优化,并具备具有竞争力的价格。
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| ERNIE 4.5 VL 424B A47B | 123,000 | $0.42 | $1.25 | Go check it out |
| ERNIE 4.5 300B A47B | 123,000 | $0.28 | $1.1 | Go check it out |
来自清华大学的GLM系列模型,具备先进的中文语言理解和生成能力。
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| GLM-4.5 | 131,072 | $0.6 | $2.2 | Go check it out |
| GLM 4.5V | 65,536 | $0.6 | $1.8 | Go check it out |
| GLM 4.1V 9B Thinking | 65,536 | $0.035 | $0.138 | Go check it out |
A fine-tuned model specifically optimized for creative and role-playing applications, featuring enhanced storytelling capabilities.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| L3 70B Euryale V2.1 | 8,192 | $1.48 | $1.48 | Go check it out |
| Sao10k L3 8B Lunaris | 8,192 | $0.05 | $0.05 | Go check it out |
| L3 8B Stheno V3.2 | 8,192 | $0.05 | $0.05 | Go check it out |
| L31 70B Euryale V2.2 | 8,192 | $1.48 | $1.48 | Go check it out |
A powerful and efficient language model from Mistral AI, designed for both commercial and open-source applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Mistral Nemo | 60,288 | $0.04 | $0.17 | Go check it out |
| Mistral 7B Instruct | 32,768 | $0.029 | $0.059 | Go check it out |
Advanced AI models from DeepSeek, offering cutting-edge inference capabilities and competitive pricing for enterprise and research applications.
| Model Name | Input Token Range | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|---|
| deepseek/deepseek-v3.1-test | - | 20,000 | Free | - | - | Free | Go check it out |
| DeepSeek V3.1 | - | 163,840 | $20 | $1 (5m) | $1 | $100 | Go check it out |
| DeepSeek R1 0528 | 1–32,768 | 163,840 | $1.5 | $0.6 (5m) | $0.9 | $6 | Go check it out |
| 131,072–204,800 | 163,840 | $3 | $0.7 (5m) | $0.5 | $6 | Go check it out | |
| 32,768–131,072 | 163,840 | $8 | $0.5 (5m) | $0.3 | $4 | Go check it out | |
| DeepSeek V3 0324 | - | 163,840 | $0.28 | $0.14 (5m) | $0.14 | $1.14 | Go check it out |
MiniMax AI的先进语言模型提供强大的对话AI能力,在客户服务、内容生成和创意应用中表现优异,并具备强大的多语言支持和企业级可扩展性。
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| MiniMax M1 | 1,000,000 | $0.55 | $2.2 | Go check it out |
An innovative AI model from Gryphe that offers professional-grade language understanding capabilities, with a focus on efficiency and adaptability, making it ideal for niche applications.
| Model Name | Context | Input (/Mt) | Output (/Mt) | Operation |
|---|---|---|---|---|
| Mythomax L2 13B | 4,096 | $0.09 | $0.09 | Go check it out |
最先进AI模型的高级集合,具备高级推理、数学证明能力以及跨多个领域的前沿语言理解能力。
| Model Name | Input Token Range | Context | Input (/Mt) | Cache Write (/Mt) | Cache read (/Mt) | Output (/Mt) | Actions |
|---|---|---|---|---|---|---|---|
| DeepSeek V3.1 | - | 163,840 | $20 | $1 (5m) | $1 | $100 | Go check it out |
| OpenAI GPT OSS 120B | - | 131,072 | $0.1 | - | - | $0.5 | Go check it out |
| GLM-4.5 | - | 131,072 | $0.6 | - | - | $2.2 | Go check it out |
| Qwen3 235B A22b Thinking 2507 | - | 131,072 | $0.3 | - | - | $3 | Go check it out |
| GLM 4.5V | - | 65,536 | $0.6 | - | - | $1.8 | Go check it out |
| OpenAI: GPT OSS 20B | - | 131,072 | $0.05 | - | - | $0.2 | Go check it out |
| MiniMax M1 | - | 1,000,000 | $0.55 | - | - | $2.2 | Go check it out |
| DeepSeek R1 0528 | 1–32,768 | 163,840 | $1.5 | $0.6 (5m) | $0.9 | $6 | Go check it out |
| 131,072–204,800 | 163,840 | $3 | $0.7 (5m) | $0.5 | $6 | Go check it out | |
| 32,768–131,072 | 163,840 | $8 | $0.5 (5m) | $0.3 | $4 | Go check it out | |
| Qwen3 235B A22B | - | 40,960 | $0.2 | - | - | $0.8 | Go check it out |
| Llama 4 Maverick Instruct | - | 1,048,576 | $0.17 | - | - | $0.85 | Go check it out |
| Llama 4 Scout Instruct | - | 131,072 | $0.1 | - | - | $0.5 | Go check it out |
| 222 | 1–200 | 222 | $2 | $5 (5m) | $4 | $3 | Go check it out |
| 200–50,000 | 222 | $3 | $6 (5m) | $5 | $4 | Go check it out | |
| 50,000–250,000 | 222 | $4 | $7 (5m) | $6 | $5 | Go check it out | |
| ERNIE 4.5 VL 424B A47B | - | 123,000 | $0.42 | - | - | $1.25 | Go check it out |
| ERNIE 4.5 300B A47B | - | 123,000 | $0.28 | - | - | $1.1 | Go check it out |
| Qwen3 32B | - | 40,960 | $0.1 | - | - | $0.45 | Go check it out |
| Qwen3 30B A3B | - | 40,960 | $0.09 | - | - | $0.45 | Go check it out |
| Kimi K2 Instruct | - | 131,072 | $0.57 | - | - | $2.3 | Go check it out |
| DeepSeek V3 0324 | - | 163,840 | $0.28 | $0.14 (5m) | $0.14 | $1.14 | Go check it out |
| test-model-interface-2 | 1–32,000 | 65,000 | $5.1 | $8.1 (5m) | $7.1 | $6.1 | Go check it out |
| $32,000–$128,000 | 65,000 | $5.3 | $8.3 (5m) | $7.3 | $6.3 | Go check it out | |
| 128,000–256,000 | 65,000 | $5.2 | $8.2 (5m) | $7.2 | $6.2 | Go check it out |