Large Model API Pricing

Explore the pricing for our model API. With transparent rates and flexible options, find the right plan to meet your needs.

Anthropic logo

Anthropic

Anthropic's Claude model offers advanced AI safety capabilities, focusing on useful, harmless, and honest AI assistants with powerful reasoning and conversational abilities.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
claude-haiku-4-5-202510011–2,00020,000$5$13 (5m)$12$10Go check it out
2,000–10,00020,000$3$8 (5m)$8$4Go check it out
claude-sonnet-4-5-202509291–2,000200,000$2$4(5m) × $8(1h)$$4$2Go check it out
2,000–20,000200,000$4$6(5 min) × $10(1 hr)$$6$4Go check it out
claude-3-7-sonnet-20250219-200,000$3$3.75 (5 m)$0.3$15Go check it out
claude-sonnet-4-20250514-200,000$3$3.75 (5 min) · $6.60 (1 hr)$0.3$15Go check it out
claude-opus-4-20250514-200,000$15$18.75 (5 m)$1.5$75Go check it out
claude-opus-4-1-20250805-200,000$15$18.75 (5 m)$1.5$75Go check it out
claude-3-5-sonnet-20241022-200,000$3$3.75 (5 m)$0.3$15Go check it out
claude-3-haiku-20240307-200,000$0.25--$1.25Go check it out
claude-3-5-haiku-20241022-200,000$0.8--$4Go check it out
OpenAI

OpenAI

OpenAI's GPT series of models offer state-of-the-art language understanding and generation capabilities, delivering outstanding performance across a wide range of tasks, and are among the industry's leading AI models.

Model NameContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Operation
gpt-5-codex400,000$1.25-$0.125$10Go check it out
openai/gpt-oss-120b131,072$0.1--$0.5Go check it out
openai/gpt-oss-20b131,072$0.05--$0.2Go check it out
gpt-5400,000$1.25$0.1(5m) \times $0.2(1h)$0.125$10Go check it out
gpt-5-mini400,000$0.25-$0.025$2Go check it out
gpt-5-nano400,000$0.05-$0.005$0.4Go check it out
gpt-5-pro400,000$15$1 (1 hour)-$120Go check it out
gpt-5-chat-latest400,000$1.25-$0.125$10Go check it out
gpt-4.1-mini1,047,576$0.4-$0.1$1.6Go check it out
gpt-4.1-nano1,047,576$0.1-$0.025$0.4Go check it out
gpt-4.11,047,576$2-$0.5$8Go check it out
gpt-4o-mini131,072$0.15-$0.075$0.6Go check it out
gpt-4o131,072$2.5-$1.25$10Go check it out
Gemini logo

Gemini

Google's Gemini model offers high-quality natural language processing capabilities, performs exceptionally well across a wide range of NLP tasks, and boasts powerful multimodal capabilities.

Model NameContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Operation
google/gemma-3-12b-it131,072$0.05--$0.1Go check it out
gemini-2.5-flash1,048,576$0.3$0.083 (5m)$0.075$2.5Go check it out
gemini-2.5-pro1,048,576$1.25$0.375 (5m)$0.3125$10Go check it out
google/gemma-3-27b-it32,768$0.119--$0.2Go check it out
gemini-3.1-flash-lite-preview1,000,000$1$2(5m) \times $2(1h)$1$2Go check it out
gemini-2.5-flash-lite-preview-09-20251,048,576$0.1$0.083 (5m)$0.01$0.4Go check it out
gemini-2.0-flash-lite1,048,576$0.075$0.083 (5m)$0.0188$0.3Go check it out
gemini-2.5-flash-lite1,048,576$0.1$0.083 (5m)$0.025$0.4Go check it out
gemini-2.5-flash-lite-preview-06-171,048,576$0.1--$0.4Go check it out
gemini-2.5-flash-preview-05-201,048,576$0.15--$3.5Go check it out
gemini-2.5-pro-preview-06-051,048,576$1.25--$10Go check it out
gemini-2.0-flash-202506091,048,576$0.15--$0.6Go check it out
Llama logo

Llama

Meta's Llama model offers state-of-the-art language understanding capabilities and features an open architecture, making it suitable for a wide range of applications.

Qwen logo

Qwen

The Qwen series of models offers powerful natural language processing capabilities and is available in a range of parameter sizes, from lightweight to enterprise-grade solutions.

Wenxin

Baidu

Baidu's ERNIE model offers advanced Chinese language understanding and multimodal capabilities, is optimized for Chinese applications, and is competitively priced.

Model NameContextInput (/Mt)Output (/Mt)Operation
baidu/ernie-4.5-vl-424b-a47b123,000$0.42$1.25Go check it out
baidu/ernie-4.5-300b-a47b-paddle123,000$0.28$1.1Go check it out
ChatGLM

THUDM

The GLM series of models from Tsinghua University feature advanced Chinese language understanding and generation capabilities.

Model NameContextInput (/Mt)Output (/Mt)Operation
zai-org/glm-4.5131,072$0.6$2.2Go check it out
zai-org/glm-4.5v65,536$0.6$1.8Go check it out
thudm/glm-4.1v-9b-thinking65,536$0.035$0.138Go check it out
Sao10K logo

Sao10K

A fine-tuned model specifically optimized for creative and role-playing applications, featuring enhanced storytelling capabilities.

Model NameContextInput (/Mt)Output (/Mt)Operation
sao10k/l3-70b-euryale-v2.18,192$1.48$1.48Go check it out
sao10k/l3-8b-lunaris8,192$0.05$0.05Go check it out
Sao10K/L3-8B-Stheno-v3.28,192$0.05$0.05Go check it out
sao10k/l31-70b-euryale-v2.28,192$1.48$1.48Go check it out
Mistralai logo

Mistralai

A powerful and efficient language model from Mistral AI, designed for both commercial and open-source applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
mistralai/mistral-nemo60,288$0.04$0.17Go check it out
mistralai/mistral-7b-instruct32,768$0.029$0.059Go check it out
Deepseek logo

Deepseek

Advanced AI models from DeepSeek, offering cutting-edge inference capabilities and competitive pricing for enterprise and research applications.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
deepseek/deepseek-v3.1-163,840$0.27$1 (5m)$1$1Go check it out
deepseek/deepseek-r1-05281–32,768163,840$1.5$0.6 (5m)$0.9$6Go check it out
131,072–204,800163,840$3$0.7 (5m)$0.5$6Go check it out
32,768–131,072163,840$8$0.5 (5m)$0.3$4Go check it out
deepseek/deepseek-v3-0324-163,840$0.28$0.14 (5m)$0.14$1.14Go check it out
deepseek/deepseek-v3.1-test-20,000Free--FreeGo check it out
MiniMax logo

MiniMax

MiniMax AI's advanced language model delivers powerful conversational AI capabilities, excelling in customer service, content generation, and creative applications, with robust multilingual support and enterprise-grade scalability.

Model NameContextInput (/Mt)Output (/Mt)Operation
minimaxai/minimax-m1-80k1,000,000$0.55$2.2Go check it out
Gryphe logo

Gryphe

An innovative AI model from Gryphe that offers professional-grade language understanding capabilities, with a focus on efficiency and adaptability, making it ideal for niche applications.

Model NameContextInput (/Mt)Output (/Mt)Operation
gryphe/mythomax-l2-13b4,096$0.09$0.09Go check it out

Mixture of Experts

A sophisticated collection of state-of-the-art AI models, featuring advanced reasoning and mathematical proof capabilities, as well as cutting-edge language understanding across multiple domains.

Model NameInput Token RangeContextInput (/Mt)Cache Write (/Mt)Cache read (/Mt)Output (/Mt)Actions
deepseek/deepseek-v3.1-163,840$0.27$1 (5m)$1$1Go check it out
openai/gpt-oss-120b-131,072$0.1--$0.5Go check it out
zai-org/glm-4.5-131,072$0.6--$2.2Go check it out
qwen/qwen3-235b-a22b-thinking-2507-131,072$0.3--$3Go check it out
zai-org/glm-4.5v-65,536$0.6--$1.8Go check it out
openai/gpt-oss-20b-131,072$0.05--$0.2Go check it out
minimaxai/minimax-m1-80k-1,000,000$0.55--$2.2Go check it out
deepseek/deepseek-r1-05281–32,768163,840$1.5$0.6 (5m)$0.9$6Go check it out
131,072–204,800163,840$3$0.7 (5m)$0.5$6Go check it out
32,768–131,072163,840$8$0.5 (5m)$0.3$4Go check it out
qwen/qwen3-235b-a22b-fp8-40,960$0.2--$0.8Go check it out
meta-llama/llama-4-maverick-17b-128e-instruct-fp8-1,048,576$0.17--$0.85Go check it out
meta-llama/llama-4-scout-17b-16e-instruct-131,072$0.1--$0.5Go check it out
Synchronous Interface Testing-2221–200222$2$5 (5m)$4$3Go check it out
200–50,000222$3$6 (5m)$5$4Go check it out
50,000–250,000222$4$7 (5m)$6$5Go check it out
baidu/ernie-4.5-vl-424b-a47b-123,000$0.42--$1.25Go check it out
baidu/ernie-4.5-300b-a47b-paddle-123,000$0.28--$1.1Go check it out
qwen/qwen3-32b-fp8-40,960$0.1--$0.45Go check it out
qwen/qwen3-30b-a3b-fp8-40,960$0.09--$0.45Go check it out
moonshotai/kimi-k2-instruct-131,072$0.57--$2.3Go check it out
deepseek/deepseek-v3-0324-163,840$0.28$0.14 (5m)$0.14$1.14Go check it out
test-model-interface-21–32,00065,000$5.1$8.1 (5m)$7.1$6.1Go check it out
$32,000–$128,00065,000$5.3$8.3 (5m)$7.3$6.3Go check it out
128,000–256,00065,000$5.2$8.2 (5m)$7.2$6.2Go check it out
Contact Us