Large Model API Pricing

Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.

Anthropic logo

Anthropic

Anthropic的Claude模型提供先进的安全AI能力,专注于有用、无害、诚实的AI助手体验,并具备强大的推理和对话能力。

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
claude-haiku-4-5-202510011–2,00020,000$5$13 (5m)$12$10Go check it out
2,000–10,00020,000$3$8 (5m)$8$4Go check it out
claude-sonnet-4-5-202509291–2,000200,000$2$4(5m) × $8(1h)$$4$2Go check it out
2,000–20,000200,000$4$6(5 min) × $10(1 hr)$$6$4Go check it out
claude-3-7-sonnet-20250219-200,000$3$3.75 (5 m)$0.3$15Go check it out
claude-sonnet-4-20250514-200,000$3$3.75 (5 min) · $6.60 (1 hr)$0.3$15Go check it out
claude-opus-4-20250514-200,000$15$18.75 (5 m)$1.5$75Go check it out
claude-opus-4-1-20250805-200,000$15$18.75 (5 m)$1.5$75Go check it out
claude-3-5-sonnet-20241022-200,000$3$3.75 (5 m)$0.3$15Go check it out
claude-3-haiku-20240307-200,000$0.25--$1.25Go check it out
claude-3-5-haiku-20241022-200,000$0.8--$4Go check it out
OpenAI

OpenAI

OpenAI's GPT series of models offer state-of-the-art language understanding and generation capabilities, delivering outstanding performance across a wide range of tasks, and are among the industry's leading AI models.

Model NameContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Operation
gpt-5-codex400,000$1.25-$0.125$10Go check it out
OpenAI GPT OSS 120B131,072$0.1--$0.5Go check it out
OpenAI: GPT OSS 20B131,072$0.05--$0.2Go check it out
gpt-5400,000$1.25$0.1(5m) \times $0.2(1h)$0.125$10Go check it out
gpt-5-mini400,000$0.25-$0.025$2Go check it out
gpt-5-nano400,000$0.05-$0.005$0.4Go check it out
gpt-5-pro400,000$15$1 (1 hour)-$120Go check it out
gpt-5-chat-latest400,000$1.25-$0.125$10Go check it out
gpt-4.1-mini1,047,576$0.4-$0.1$1.6Go check it out
gpt-4.1-nano1,047,576$0.1-$0.025$0.4Go check it out
gpt-4.11,047,576$2-$0.5$8Go check it out
gpt-4o-mini131,072$0.15-$0.075$0.6Go check it out
gpt-4o131,072$2.5-$1.25$10Go check it out
Gemini logo

Gemini

Google的Gemini模型提供高质量的语言处理能力,在各种NLP任务中表现出色,并具备强大的多模态能力。

Model NameContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Operation
Gemma3 12B131,072$0.05--$0.1Go check it out
gemini-2.5-flash1,048,576$0.3$0.083 (5m)$0.075$2.5Go check it out
gemini-2.5-pro1,048,576$1.25$0.375 (5m)$0.3125$10Go check it out
Gemma 3 27B32,768$0.119--$0.2Go check it out
gemini-3.1-flash-lite-preview1,000,000$1$2(5m) \times $2(1h)$1$2Go check it out
gemini-2.5-flash-lite-preview-09-20251,048,576$0.1$0.083 (5m)$0.01$0.4Go check it out
gemini-2.0-flash-lite1,048,576$0.075$0.083 (5m)$0.0188$0.3Go check it out
gemini-2.5-flash-lite1,048,576$0.1$0.083 (5m)$0.025$0.4Go check it out
gemini-2.5-flash-lite-preview-06-171,048,576$0.1--$0.4Go check it out
gemini-2.5-flash-preview-05-201,048,576$0.15--$3.5Go check it out
gemini-2.5-pro-preview-06-051,048,576$1.25--$10Go check it out
gemini-2.0-flash-202506091,048,576$0.15--$0.6Go check it out
Llama logo

Llama

Meta's Llama model offers state-of-the-art language understanding capabilities and features an open architecture, making it suitable for a wide range of applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
Llama 3.1 8B Instruct16,384$0.02$0.05Go check it out
Llama 3.3 70B Instruct131,072$0.13$0.39Go check it out
Llama 4 Maverick Instruct1,048,576$0.17$0.85Go check it out
Llama 4 Scout Instruct131,072$0.1$0.5Go check it out
Llama 3.2 3B Instruct32,768$0.03$0.05Go check it out
Qwen logo

Qwen

Qwen系列模型提供高效的语言处理能力,具有多种参数规模,涵盖从轻量级到企业级的解决方案。

Wenxin

Baidu

百度的ERNIE模型提供先进的中文语言理解和多模态能力,针对中文应用进行了优化,并具备具有竞争力的价格。

Model NameContextInput (/Mt)Output (/Mt)Operation
ERNIE 4.5 VL 424B A47B123,000$0.42$1.25Go check it out
ERNIE 4.5 300B A47B123,000$0.28$1.1Go check it out
ChatGLM

THUDM

来自清华大学的GLM系列模型,具备先进的中文语言理解和生成能力。

Model NameContextInput (/Mt)Output (/Mt)Operation
GLM-4.5131,072$0.6$2.2Go check it out
GLM 4.5V65,536$0.6$1.8Go check it out
GLM 4.1V 9B Thinking65,536$0.035$0.138Go check it out
Sao10K logo

Sao10K

A fine-tuned model specifically optimized for creative and role-playing applications, featuring enhanced storytelling capabilities.

Model NameContextInput (/Mt)Output (/Mt)Operation
L3 70B Euryale V2.1 8,192$1.48$1.48Go check it out
Sao10k L3 8B Lunaris 8,192$0.05$0.05Go check it out
L3 8B Stheno V3.28,192$0.05$0.05Go check it out
L31 70B Euryale V2.28,192$1.48$1.48Go check it out
Mistralai logo

Mistralai

A powerful and efficient language model from Mistral AI, designed for both commercial and open-source applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
Mistral Nemo60,288$0.04$0.17Go check it out
Mistral 7B Instruct32,768$0.029$0.059Go check it out
Deepseek logo

Deepseek

Advanced AI models from DeepSeek, offering cutting-edge inference capabilities and competitive pricing for enterprise and research applications.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
deepseek/deepseek-v3.1-test-20,000Free--FreeGo check it out
DeepSeek V3.1-163,840$20$1 (5m)$1$100Go check it out
DeepSeek R1 05281–32,768163,840$1.5$0.6 (5m)$0.9$6Go check it out
131,072–204,800163,840$3$0.7 (5m)$0.5$6Go check it out
32,768–131,072163,840$8$0.5 (5m)$0.3$4Go check it out
DeepSeek V3 0324-163,840$0.28$0.14 (5m)$0.14$1.14Go check it out
MiniMax logo

MiniMax

MiniMax AI的先进语言模型提供强大的对话AI能力,在客户服务、内容生成和创意应用中表现优异,并具备强大的多语言支持和企业级可扩展性。

Model NameContextInput (/Mt)Output (/Mt)Operation
MiniMax M11,000,000$0.55$2.2Go check it out
Gryphe logo

Gryphe

An innovative AI model from Gryphe that offers professional-grade language understanding capabilities, with a focus on efficiency and adaptability, making it ideal for niche applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
Mythomax L2 13B4,096$0.09$0.09Go check it out

Mixture of Experts

最先进AI模型的高级集合,具备高级推理、数学证明能力以及跨多个领域的前沿语言理解能力。

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
DeepSeek V3.1-163,840$20$1 (5m)$1$100Go check it out
OpenAI GPT OSS 120B-131,072$0.1--$0.5Go check it out
GLM-4.5-131,072$0.6--$2.2Go check it out
Qwen3 235B A22b Thinking 2507-131,072$0.3--$3Go check it out
GLM 4.5V-65,536$0.6--$1.8Go check it out
OpenAI: GPT OSS 20B-131,072$0.05--$0.2Go check it out
MiniMax M1-1,000,000$0.55--$2.2Go check it out
DeepSeek R1 05281–32,768163,840$1.5$0.6 (5m)$0.9$6Go check it out
131,072–204,800163,840$3$0.7 (5m)$0.5$6Go check it out
32,768–131,072163,840$8$0.5 (5m)$0.3$4Go check it out
Qwen3 235B A22B-40,960$0.2--$0.8Go check it out
Llama 4 Maverick Instruct-1,048,576$0.17--$0.85Go check it out
Llama 4 Scout Instruct-131,072$0.1--$0.5Go check it out
2221–200222$2$5 (5m)$4$3Go check it out
200–50,000222$3$6 (5m)$5$4Go check it out
50,000–250,000222$4$7 (5m)$6$5Go check it out
ERNIE 4.5 VL 424B A47B-123,000$0.42--$1.25Go check it out
ERNIE 4.5 300B A47B-123,000$0.28--$1.1Go check it out
Qwen3 32B-40,960$0.1--$0.45Go check it out
Qwen3 30B A3B-40,960$0.09--$0.45Go check it out
Kimi K2 Instruct-131,072$0.57--$2.3Go check it out
DeepSeek V3 0324-163,840$0.28$0.14 (5m)$0.14$1.14Go check it out
test-model-interface-21–32,00065,000$5.1$8.1 (5m)$7.1$6.1Go check it out
$32,000–$128,00065,000$5.3$8.3 (5m)$7.3$6.3Go check it out
128,000–256,00065,000$5.2$8.2 (5m)$7.2$6.2Go check it out
Contact Us