Model Marketplace

One Unified API

Compare and access top AI models from OpenAI, Anthropic, Google, BytePlus, Kuaishou and more — all in one place.

Provider

Showing 107 models

Seedance-2.0

BytePlusVideo

NEW

Seedance 2.0 is BytePlus new-generation video model featuring superior visual fidelity, longer duration support, and advanced scene understanding for professional-grade content creation.

Pricing

$7.0 /M tokens (480p/720p, without video)
$4.3 /M tokens (480p/720p, with video)
$7.7 /M tokens (1080p, without video)
$4.7 /M tokens (1080p, with video)
$4.0 /M tokens (4k, without video)
$2.4 /M tokens (4k, with video)

dreamina-seedance-2-0-260128

Seedance-2.0-enhance

BytePlusVideo

NEW

Seedance 2.0 is BytePlus new-generation video model featuring superior visual fidelity, longer duration support, and advanced scene understanding for professional-grade content creation.

Pricing

$7.0 /M tokens (480p/720p, without video)
$4.3 /M tokens (480p/720p, with video)
$7.7 /M tokens (1080p, without video)
$4.7 /M tokens (1080p, with video)
$4.0 /M tokens (4k, without video)
$2.4 /M tokens (4k, with video)

dreamina-seedance-2-0-260128-enhance

Seedance-2.0-mini

BytePlusVideo

NEW

Pricing varies based on whether the input includes video. 1080p output is not supported.

Pricing

$3.5 /M tokens (without video)
$2.1 /M tokens (with video)

dreamina-seedance-2-0-mini-260615

Seedance-2.0-fast

BytePlusVideo

NEW

Seedance 2.0 Fast is BytePlus fast-response video model built for lower-latency generation, delivering strong visual quality and efficient turnaround for high-throughput creative workflows.

Pricing

$5.6 /M tokens (without video)
$3.3 /M tokens (with video)

dreamina-seedance-2-0-fast-260128

Seedance-2.0-fast-enhance

BytePlusVideo

NEW

Seedance 2.0 Fast is BytePlus fast-response video model built for lower-latency generation, delivering strong visual quality and efficient turnaround for high-throughput creative workflows.

Pricing

$5.6 /M tokens (without video)
$3.3 /M tokens (with video)

dreamina-seedance-2-0-fast-260128-enhance

Seedance-1.5-pro

BytePlusVideo

Seedance 1.5 is the latest video generation model launched by ByteDance🚀. Based on the functionalities Seedance 1.0 Pro functionalities, Seedance Pro 1.5 can automatically generate matching voices, sound effects, and background music based on text prompts and visual content.

Pricing

$0.0024 /K tokens(w/ audio)
$0.0012 /K tokens(w/o audio)

seedance-1.5-pro

Seedance-1.0-pro

BytePlusVideo

Seedance 1.0 Pro, the large-parameter version of the model suite, features a unique multi-shot storytelling capability and demonstrates outstanding performance across multiple dimensions. It achieves breakthroughs in semantic understanding and instruction-following, enabling the generation of 1080P high-definition videos with smooth motion, rich details, diverse styles, and cinematic visual quality.

Pricing

$0.0025 /K tokens

seedance-1.0-pro-25028

Seedance-1.0-pro-fast

BytePlusVideo

Byteplus Seedance 1.0 Professional version, optimized for text-to-video conversion with high-quality video generation capabilities. Supports multiple style transformations, ideal for rapid prototyping and creative exploration. Lightweight design ensures fast response and efficient processing.

Pricing

$0.0010 /K tokens

20% OFF

seedance-1.0-pro-fast-251015

Seedance-1.0-lite-i2v

BytePlusVideo

Byteplus Seedance 1.0 Lite version, optimized for image-to-video conversion with high-quality video generation capabilities. Supports multiple style transformations, ideal for rapid prototyping and creative exploration. Lightweight design ensures fast response and efficient processing.

Pricing

$0.0018 /K tokens

byteplus-seedance-1.0-lite-i2v

Seedream-5.0-pro

BytePlusImage

BytePlus Seedream 5.0 Pro is the flagship image generation model featuring superior detail rendering, stronger aesthetics, and improved prompt adherence for professional-grade visual creations.

Pricing

Input (first image) - Free
Input (from the 2nd image) - $0.003 /pcs
Output (≤ 2.36M px) - $0.045 /pcs
Output (> 2.36M px) - $0.09 /pcs

doubao-seedream-5-0-pro-260628

Seedream-5.0-lite

BytePlusImage

BytePlus Seedream 5.0 Lite is the latest image generation model featuring enhanced detail rendering, up to 3K/4K resolution output, and improved prompt adherence for stunning visual creations.

Pricing

$0.0350 /pcs

byteplus-seedream-5.0-lite

Seedream-4.5

BytePlusImage

HOT

BytePlus Seedream 4.5 is a New High-Aesthetic Image Generation Model with stronger spatial understanding, richer world knowledge, superior aesthetics, higher consistency, and smarter instruction following for precise visual creation.

Pricing

$0.0400 /pcs

byteplus-seedream-4.5

Seedream-4.0

BytePlusImage

Byteplus Seedream 4.0 is a SOTA multimodal image generation model, enabling text-to-image creation, image editing and multi-image generation within a single model, supports a wide range of creative scenarios.

Pricing

$0.0300 /pcs

byteplus-seedream-4.0

Qwen3.7-Max

AliCloudText

NEW

Alibaba Cloud Qwen3.7-Max is the flagship Qwen text model, delivering strong general reasoning, instruction-following and tool use, with built-in web search support.

Pricing

Input - $2.5 /M tokens
Input (cache hit) - $0.5 /M tokens
Explicit cache write - $3.125 /M tokens
Explicit cache hit - $0.25 /M tokens
Output - $7.5 /M tokens

qwen3.7-max

Qwen3.7-Plus

AliCloudText

NEW

Alibaba Cloud Qwen3.7-Plus is the cost-effective Qwen3.7 model, pairing strong text capabilities with upgraded vision-language understanding and full-stack agent intelligence for coding, tool use and multimodal interaction, with a 1M context window.

Pricing

Input - $0.4 /M tokens (<=256K)
Input (cache hit) - $0.08 /M tokens (<=256K)
Explicit cache write - $0.5 /M tokens (<=256K)
Explicit cache hit - $0.04 /M tokens (<=256K)
Output - $1.6 /M tokens (<=256K)
Input - $1.2 /M tokens (256K~1M)
Input (cache hit) - $0.24 /M tokens (256K~1M)
Explicit cache write - $1.5 /M tokens (256K~1M)
Explicit cache hit - $0.12 /M tokens (256K~1M)
Output - $4.8 /M tokens (256K~1M)

qwen3.7-plus

Wan-2.6

AliCloudVideo

Alibaba Cloud Wan 2.6 is an advanced video generation model that creates high-quality videos from text descriptions, supporting multiple resolutions including 720p and 1080p.

Pricing

T2V 720p - $0.10 /seconds
T2V 1080p - $0.15 /seconds
I2V 720p(w/ audio) - $0.10 /seconds
I2V 1080p(w/ audio) - $0.15 /seconds
R2V 720p(w/ audio) - $0.10 /seconds
R2V 1080p(w/ audio) - $0.15 /seconds
R2V-Flash 720p(w/o audio) - $0.025 /seconds
R2V-Flash 1080p(w/o audio) - $0.0375 /seconds
R2V-Flash 720p(w/ audio) - $0.05 /seconds
R2V-Flash 1080p(w/ audio) - $0.075 /seconds

alicloud-wan-2.6

Wan-2.5-preview

AliCloudVideo

Alibaba Cloud Wan 2.5 preview is a powerful video generation model that transforms text prompts into dynamic videos, supporting resolutions from 480p to 1080p.

Pricing

T2V 480p - $0.05 /seconds
T2V 720p - $0.10 /seconds
T2V 1080p - $0.15 /seconds
I2V 480p(w/ audio) - $0.05 /seconds
I2V 720p(w/ audio) - $0.10 /seconds
I2V 1080p(w/ audio) - $0.15 /seconds

alicloud-wan-2.5

qwen-image-2.0-pro

AliCloudImage

NEW

Alibaba Cloud Qwen Image 2.0 Pro unified image generation and editing model with stronger text rendering, realism, and prompt adherence.

Pricing

$0.071429 /pcs

qwen-image-2.0-pro

qwen-image-2.0

AliCloudImage

Alibaba Cloud Qwen Image 2.0 accelerated image generation and editing model balancing quality and response time.

Pricing

$0.028600 /pcs

qwen-image-2.0

gemini-2.5-flash

GoogleText

Google Gemini 2.5 Flash model, a fast and efficient language model optimized for speed and cost-effectiveness.

Pricing

Input - $0.30 /M tokens
Cached input - $0.03 /M tokens
Output - $2.50 /M tokens

gemini-2.5-flash

gemini-2.5-flash-lite

GoogleText

Google Gemini 2.5 Flash model, a fast and efficient language model optimized for speed and cost-effectiveness.

Pricing

Input - $0.10 /M tokens
Cached input - $0.01 /M tokens
Output - $0.40 /M tokens

gemini-2.5-flash-lite

gemini-2.5-pro

GoogleText

Google Gemini 2.5 Pro model, offering advanced capabilities with enhanced reasoning and understanding.

Pricing

Input - $1.25 /M tokens <= 200k Tokens
Cached input - $0.125 /M tokens <= 200k Tokens
Input - $2.50 /M tokens > 200k Tokens
Output - $10.00 /M tokens <= 200k Tokens
Output - $15.00 /M tokens > 200k Tokens

gemini-2.5-pro

gemini-3-flash-preview

GoogleText

Google Gemini 2.5 Flash model, a fast and efficient language model optimized for speed and cost-effectiveness.

Pricing

Input - $0.50 /M tokens
Cached input - $0.05 /M tokens
Output - $3.00 /M tokens

gemini-3-flash-preview

gemini-3.1-pro-preview

GoogleText

NEW

Google Gemini 3.1 Pro Preview model, the latest preview version with cutting-edge capabilities.

Pricing

Input - $2.00 /M tokens (prompts <= 200k)
Input - $4.00 /M tokens (prompts > 200k)
Output - $12.00 /M tokens (prompts <= 200k)
Output - $18.00 /M tokens (prompts > 200k)
Cached input - $0.20 /M tokens (prompts <= 200k)
Cached input - $0.40 /M tokens (prompts > 200k)

gemini-3.1-pro-preview

gemini-3.1-flash-lite-preview

GoogleText

NEW

Google Gemini 3.1 Flash-Lite Preview model, optimized for high-throughput lightweight tasks with low latency and cost efficiency.

Pricing

Input (Text/Image/Video) - $0.25 /M tokens
Input (Audio) - $0.50 /M tokens
Output (incl. thinking tokens) - $1.50 /M tokens
Cached input (Text/Image/Video) - $0.025 /M tokens
Cached input (Audio) - $0.05 /M tokens
Cache storage - $1.00 /hour /1M tokens

gemini-3.1-flash-lite-preview

gemini-3.5-flash

GoogleText

NEW

Google Gemini 3.5 Flash model, optimized for fast multimodal reasoning, search-grounded responses, and cost-efficient high-throughput workloads.

Pricing

Input - $1.50 /M tokens
Output (incl. thinking tokens) - $9.00 /M tokens
Cached input - $0.15 /M tokens
Cache storage - $1.00 /hour /1M tokens

gemini-3.5-flash

Veo-3.1-generate-preview

GoogleVideo

NEW

Google's latest Veo 3.1 video generation model. Generate high-quality videos with synchronized speech/sound effects from a text prompt or reference image

Pricing

720p - $0.40 /seconds
1080p - $0.40 /seconds
4K - $0.60 /seconds

veo-3.1-generate-preview

Veo-3.1-fast-generate-preview

GoogleVideo

NEW

Google's latest Veo 3.1 video generation model. Generate videos with synchronized speech/sound effects from a text prompt or reference image faster

Pricing

720p - $0.15 /seconds
1080p - $0.15 /seconds
4K - $0.35 /seconds

veo-3.1-fast-generate-preview

Gemini-3.1-flash-lite-image

GoogleImage

NEW

(Nano Banana 2🍌) Google's lightweight native image generation model built on Gemini 3.1 Flash Lite. Optimized for maximum speed and cost efficiency in high-throughput scenarios while delivering high-quality, photorealistic imagery.

Pricing

Input - $0.25 /M tokens
Output - $1.50 /M tokens
Output (image) - $30.00 /M tokens
1K(1024x1024px) - about $0.0336 /pcs

gemini-3.1-flash-lite-image

Gemini-2.5-flash-image

GoogleImage

NEW

(Nano Banana🍌) Google Gemini 2.5 Flash image model supporting multimodal text and image input/output. Can be used via OpenAI-compatible interface by adding modalities parameter. Lightweight and fast, ideal for real-time application scenarios requiring image understanding and generation.

Pricing

$0.039 /pcs

gemini-2.5-flash-image

Gemini-3-pro-image-preview

GoogleImage

NEW

(Nano Banana Pro🍌) Google Gemini 3 Pro image generation model with advanced multimodal capabilities. Features state-of-the-art image synthesis, enhanced prompt understanding, and superior visual quality. Supports creative image generation, editing, and style transfer with exceptional detail and accuracy.

Pricing

1K-2K(1024x1024px-2048x2048px) - $0.134 /pcs
4K(4096x4096px) - $0.24 /pcs

gemini-3-pro-image-preview

Gemini-3.1-flash-image-preview

GoogleImage

NEW

(Nano Banana 2🍌) is the latest state-of-the-art image model. Dramatically closes the gap between speed and visual fidelity, delivering high-quality, photorealistic imagery.

Pricing

512px -$0.045 /pcs
1K - $0.067 /pcs
2K - $0.101 /pcs
4K - $0.151 /pcs

gemini-3.1-flash-image-preview

Gemini-3-pro-image

GoogleImage

NEW

(Nano Banana Pro🍌) Google's professional native image generation model built on Gemini 3 Pro with a reasoning core, delivering studio-grade 4K visuals, complex layouts and precise text rendering. Text input/output pricing aligned with Gemini 3.1 Pro.

Pricing

Input - $2.00 /M tokens
Output - $12.00 /M tokens
Output (image) - $120.00 /M tokens

gemini-3-pro-image

Gemini-3.1-flash-image

GoogleImage

NEW

(Nano Banana 2🍌) Google's high-efficiency native image generation model built on Gemini 3.1 Flash. Purpose-built for speed and high-throughput scenarios while delivering high-quality, photorealistic imagery.

Pricing

Input - $0.50 /M tokens
Output - $3.00 /M tokens
Output (image) - $60.00 /M tokens

gemini-3.1-flash-image

gpt-image-2

OpenAIImage

NEW

OpenAI GPT Image 2 is the latest multimodal image generation and editing model, supporting both text and image inputs with high-fidelity image outputs. Delivers improved prompt understanding, better instruction following, and enhanced quality for creative, design, and production scenarios.

Pricing

Image Input - $8.00 /M tokens
Image Cached input - $2.00 /M tokens
Image Output - $30.00 /M tokens
Text Input - $5.00 /M tokens
Text Cached input - $1.25 /M tokens

gpt-image-2

Sora 2

OpenAIVideo

OpenAI's latest Sora 2 video generation model with significantly improved video quality and duration. Standard version offers an affordable entry point for developers and small businesses. Supports more complex scene understanding, more natural physics simulation, and more precise text-to-video conversion. Can generate up to 60 seconds of HD video, perfect for professional content creation.

Pricing

Portrait 720×1280 - $0.10 /seconds
Landscape 1280×720 - $0.10 /seconds

openai-sora-2

Sora 2 Pro

OpenAIVideo

HOT

OpenAI's advanced Sora 2 Pro is a high-fidelity AI video generation model designed for professional content creators and commercial applications. Features significantly improved video quality, extended duration support, and enhanced scene understanding. Supports multiple aspect ratios (portrait and landscape) with customizable resolution and duration settings. Delivers superior video quality with more natural physics simulation and precise text-to-video conversion. Ideal for brand advertising, product showcases, educational content, and creative productions requiring professional-grade video output.

Pricing

Portrait 720×1280 - $0.30 /seconds
Landscape 1280×720 - $0.30 /seconds
HD Portrait 1024×1792 - $0.50 /seconds
HD Landscape 1792×1024 - $0.50 /seconds

openai-sora-2-pro

Grok-4.5

XAIText

NEW

xAI's latest frontier model with advanced reasoning and agentic capabilities, featuring a 500k token context window.

Pricing

Input - $2.00 /M tokens
Cached input - $0.50 /M tokens
Output - $6.00 /M tokens

grok-4.5

Grok-4.3

XAIText

NEW

Agentic tool calling, minimal hallucinations, non-reasoning mode with a 1 million token context window.

Pricing

Input - $1.25 /M tokens
Cached input - $0.20 /M tokens
Output - $2.50 /M tokens

grok-4.3

Kling

KlingVideo

Kling AI video generation model, optimized for content creation. Supports text-to-video, image-to-video, and multiple generation modes. Features excellent character motion understanding and scene coherence, ideal for short video creation, social media content, and marketing video production.

kling-kling

Kling-O1-video

KlingVideo

Kling O1 Video is an advanced video generation model featuring the Omni-Video capability. Supports text-to-video, image-to-video, and video editing with reference images. Offers multiple durations (5-10s), resolutions up to 1080p, and various aspect ratios. Ideal for professional video content creation.

Pricing

O1 std(w/o ref video) - $0.0857 /seconds
O1 std(w/ ref video) - $0.1286 /seconds
O1 pro(w/o ref video) - $0.1143 /seconds
O1 pro(w/ ref video) - $0.1714 /seconds

kling-o1-video

Kling-O1-image

KlingImage

Kling O1 Image is a high-quality AI image generation model. Supports text-to-image with multiple aspect ratios, resolutions (1K/2K), and batch generation up to 9 images. Features excellent prompt understanding and photorealistic output quality for creative and commercial applications.

Pricing

O1 image - $0.0286 /pcs

kling-o1-image

Kling-omni-v3

KlingImage

NEW

Kling AI platform providing Omni-Image and Omni-Video generation with multi-subject composition, reference assets, and high-quality media creation.

Pricing

v3 Omni-image 1K/2K - $0.0286 /pcs
v3 Omni-image 4K - $0.0571 /pcs
v3 Omni-video std(w/o ref, no audio) - $0.0857 /seconds
v3 Omni-video std(w/o ref, w/ audio) - $0.1143 /seconds
v3 Omni-video std(w/ ref, no audio) - $0.1286 /seconds
v3 Omni-video std(w/ ref, w/ audio) - $0.1286 /seconds
v3 Omni-video pro(w/o ref, no audio) - $0.1143 /seconds
v3 Omni-video pro(w/o ref, w/ audio) - $0.1429 /seconds
v3 Omni-video pro(w/ ref, no audio) - $0.1714 /seconds
v3 Omni-video pro(w/ ref, w/ audio) - $0.1714 /seconds

kling

viduq3-pro

ViduVideo

NEW

Vidu Q3 Pro is the highest-quality model in the Q3 series, supporting text-to-video, image-to-video, and start-end-to-video with audio-visual synchronization and multi-shot segmentation. Duration 1-16s with resolutions up to 1080P.

Pricing

540P - $0.045 /seconds
720P - $0.10 /seconds
1080P - $0.12 /seconds

Off-peak hours are charged at half price. Enabling audio for image-to-video adds 15 credits ($0.075) per task.

viduq3-pro

viduq3-turbo

ViduVideo

NEW

Vidu Q3 Turbo is optimized for faster generation, supporting text-to-video, image-to-video, start-end-to-video, and reference-to-video. Duration 1-16s with resolutions up to 1080P, ideal for quick iterations.

Pricing

540P - $0.035 /seconds
720P - $0.055 /seconds
1080P - $0.065 /seconds
Reference-to-Video 540P - $0.02 /seconds
Reference-to-Video 720P - $0.05 /seconds
Reference-to-Video 1080P - $0.065 /seconds

Off-peak hours are charged at half price. Enabling audio for reference-to-video adds 15 credits ($0.075) per task.

viduq3-turbo

viduq3-mix

ViduVideo

NEW

Vidu Q3 Mix balances quality and speed for reference-to-video generation in mixed creative scenarios. Duration 3-16s with resolutions up to 1080P.

Pricing

Reference-to-Video 720P - $0.12 /seconds
Reference-to-Video 1080P - $0.145 /seconds

Off-peak discount is not supported. Enabling audio adds 15 credits ($0.075) per task.

viduq3-mix

Hailuo

MiniMaxVideo

MiniMax Hailuo AI video generation model, renowned for natural fluid motion and precise scene understanding. Supports various video styles from realistic to cartoon, static to dynamic. Built-in intelligent scene analysis automatically optimizes video pacing and transition effects.

minimax-hailuo

MiniMax-M2.5

MiniMaxText

NEW

MiniMax-M2.5 is a frontier text model from MiniMax, engineered for agent workflows, coding, reasoning, and complex long-context tasks.

Pricing

Input - $0.30 /M tokens
Cached input - $0.03 /M tokens
Output - $1.20 /M tokens

MiniMax-M2.5

MiniMax-M3

MiniMaxText

NEW

MiniMax-M3 is the latest frontier model from MiniMax, built for agent workflows, coding, reasoning, and complex long-context tasks with up to 1M-token context.

Pricing

Context ≤ 512K Input - $0.30 /M tokens
Context ≤ 512K Output - $1.20 /M tokens
Context ≤ 512K Cache read - $0.06 /M tokens
Context 512K ~ 1M Input - $0.60 /M tokens
Context 512K ~ 1M Output - $2.40 /M tokens
Context 512K ~ 1M Cache read - $0.12 /M tokens

MiniMax-M3

GLM-5.2

ZAIText

NEW

GLM-5.2 is Zhipu's flagship model for long-horizon coding and agentic tasks, featuring a usable 1M context window.

Pricing

Input - $1.4 /M tokens
Cached Input - $0.26 /M tokens
Output - $4.4 /M tokens

glm-5.2

GLM-5.1

ZAIText

NEW

GLM-5.1 is a Zhipu GLM-5 series model for coding and agentic tasks, supporting long-context chat completion.

Pricing

Input - $1.4 /M tokens
Cached Input - $0.26 /M tokens
Output - $4.4 /M tokens

glm-5.1

GLM-4.6v

ZAIText

NEW

GLM-4.6v is a cost-effective GLM model supporting chat completion.

Pricing

Input - $0.3 /M tokens
Cached Input - $0.05 /M tokens
Output - $0.9 /M tokens

glm-4.6v

GLM

ZAIText

GLM is Zhipu's flagship model line for complex dialogue and agent tasks. It supports thinking mode, native tool calling, and MCP integration, with up to 128K context for long conversations and multi-step reasoning workflows.

Pricing

Input [0, 32k) - $0.28 /M tokens
Output [0, 0.2k) - $1.12 /M tokens
Input [0, 32k) - $0.42 /M tokens
Output [0.2k, ∞) - $1.96 /M tokens
Input [32k, 200k) - $0.56 /M tokens
Output [0.2k, ∞) - $2.24 /M tokens
Cached input - $0.11 /M tokens

glm-4.7

DeepSeek-V4-Flash

DeepSeekText

NEW

DeepSeek-V4-Flash supports both non-thinking and thinking modes. Context window: 1M. Max output: 384K. Fast and cost-effective for daily tasks and agent workflows.

Pricing

Input (cache hit) - $0.0028 /M tokens
Input (cache miss) - $0.14 /M tokens
Output - $0.28 /M tokens

deepseek-v4-flash

DeepSeek-V4-Pro

DeepSeekText

NEW

DeepSeek-V4-Pro is the most powerful DeepSeek model with advanced reasoning capabilities. Context window: 1M. Max output: 384K. Ideal for complex reasoning and agent tasks.

Pricing

Input (cache hit) - $0.0145 /M tokens
Input (cache miss) - $1.74 /M tokens
Output - $3.48 /M tokens

deepseek-v4-pro

DeepSeek-V3.2 (chat)

DeepSeekText

DeepSeek-V3.2 non-thinking mode for fast chat and standard tool usage. Context window: 128K. Output: default 4K, max 8K.

Pricing

Input (cache hit) - $0.028 /M tokens
Input (cache miss) - $0.28 /M tokens
Output - $0.42 /M tokens

deepseek-v3

deepseek-reasoner

DeepSeekText

DeepSeek-V3.2 thinking mode (deepseek-reasoner) for complex reasoning and agent tasks. Context window: 128K. Output: default 32K, max 64K.

Pricing

Input (cache hit) - $0.028 /M tokens
Input (cache miss) - $0.28 /M tokens
Output - $0.42 /M tokens

deepseek-r1

Doubao-Seed-1.6-thinking

DoubaoText

Doubao Seed 1.6 Thinking version, optimized for analytical tasks. Features powerful logical reasoning and problem decomposition capabilities, displaying detailed thought processes. Ideal for education, academic research, and business analysis requiring transparent reasoning.

Pricing

Input [0,32K] - CNY 0.80 /M tokens
Output [0,32K] - CNY 8.00 /M tokens
Input (32,128K] - CNY 1.20 /M tokens
Output (32,128K] - CNY 16.00 /M tokens
Input (128,256K] - CNY 2.40 /M tokens
Output (128,256K] - CNY 24.00 /M tokens
Cache storage - CNY 0.017 /M tokens/hour
Cache input - CNY 0.16 /M tokens

doubao-seed-1.6-thinking

Seed-1.8

BytePlusText

Seed 1.8 general-purpose version by ByteDance Seed team, a balanced language model. Performs well across dialogue, writing, translation, and summarization tasks. Deeply optimized for Chinese scenarios, understanding Chinese context and cultural background, suitable for daily use.

Pricing

Input(0-128k) - $0.25 /M tokens
Input(128k-256k) - $0.50 /M tokens
Cache Hit - $0.05 /M tokens
Output(0-128k) - $2.00 /M tokens
Output(128k-256k) - $4.00 /M tokens
Cache Storage - $0.0083 /M tokens/hour

seed-1.8

Seed-1.8-251228

BytePlusText

Pinned Seed-1.8 snapshot dated 251228, a BytePlus language model optimized for Chinese context and general-purpose tasks.

Pricing

Prompt [0,128K] - $0.25 /M tokens
Cache-hit - $0.05 /M tokens
Output - $2.00 /M tokens

seed-1-8-251228

Doubao-Seed-1.8-251228

DoubaoText

Pinned Doubao Seed-1.8 snapshot dated 251228. ByteDance's Doubao version of the Seed-1.8 model for Chinese dialogue and text generation tasks.

Pricing

Prompt [0,128K] - $0.25 /M tokens
Cache-hit - $0.05 /M tokens
Output - $2.00 /M tokens

doubao-seed-1-8-251228

claude-opus-5

AnthropicText

NEW

Claude Opus 5 is Anthropic's most intelligent model to date, with state-of-the-art performance on complex tasks. It features enhanced reasoning, coding, and agentic capabilities, and supports extended thinking for deeper problem-solving.

Pricing

Input - $5.00 /M tokens
Cache writes(5m) - $6.25 /M tokens
Cache writes(1h) - $10.00 /M tokens
Cache hits & refreshes - $0.50 /M tokens
Output - $25.00 /M tokens

claude-opus-5

claude-sonnet-5

AnthropicText

NEW

Claude Sonnet 5 is Anthropic's latest flagship model, achieving world-leading performance in AI agents, programming, and computer usage. Features enhanced knowledge base and exceptional long-text processing capabilities. Particularly suitable for complex long-term tasks, code review, and technical documentation. Excels in accuracy and attention to detail.

Pricing

Input - $2.00 /M tokens
Cache writes(5m) - $2.50 /M tokens
Cache writes(1h) - $4.00 /M tokens
Cache hits & refreshes - $0.20 /M tokens
Output - $10.00 /M tokens

claude-sonnet-5

claude-fable-5

AnthropicText

NEW

Claude Fable 5 is Anthropic's most capable widely-released model, designed for the most demanding reasoning and long-horizon agentic work. Features a 1M-token context window, up to 128k output tokens, always-on adaptive thinking, and vision support. Released June 9, 2026.

Pricing

Input - $10.00 /M tokens
Cache writes(5m) - $12.50 /M tokens
Cache writes(1h) - $20.00 /M tokens
Cache hits & refreshes - $1.00 /M tokens
Output - $50.00 /M tokens

claude-fable-5

claude-opus-4.8

AnthropicText

NEW

Claude Opus 4.8 is Anthropic's most powerful model, offering exceptional reasoning capabilities, deep knowledge understanding, and nuanced language processing. Designed for the most demanding tasks requiring complex analysis, creative writing, and sophisticated problem-solving.

Pricing

Input - $5.00 /M tokens
Cache writes(5m) - $6.25 /M tokens
Cache writes(1h) - $10.00 /M tokens
Cache hits & refreshes - $0.50 /M tokens
Output - $25.00 /M tokens

claude-opus-4-8

claude-opus-4.7

AnthropicText

NEW

Claude Opus 4.7 is Anthropic's most powerful model, offering exceptional reasoning capabilities, deep knowledge understanding, and nuanced language processing. Designed for the most demanding tasks requiring complex analysis, creative writing, and sophisticated problem-solving.

Pricing

Input - $5.00 /M tokens
Cache writes(5m) - $6.25 /M tokens
Cache writes(1h) - $10.00 /M tokens
Cache hits & refreshes - $0.50 /M tokens
Output - $25.00 /M tokens

claude-opus-4-7

claude-sonnet-4.5

AnthropicText

Claude Sonnet 4.5 is Anthropic's latest flagship model, achieving world-leading performance in AI agents, programming, and computer usage. Features enhanced knowledge base and exceptional long-text processing capabilities. Particularly suitable for complex long-term tasks, code review, and technical documentation. Excels in accuracy and attention to detail.

Pricing

Input - $3.00 /M tokens
Cache writes(5m) - $3.75 /M tokens
Cache writes(1h) - $6.00 /M tokens
Cache hits & refreshes - $0.30 /M tokens
Output - $15.00 /M tokens

claude-sonnet-4-5

claude-sonnet-4.6

AnthropicText

NEW

Claude Sonnet 4.6 is Anthropic's latest flagship model, achieving world-leading performance in AI agents, programming, and computer usage. Features enhanced knowledge base and exceptional long-text processing capabilities. Particularly suitable for complex long-term tasks, code review, and technical documentation. Excels in accuracy and attention to detail.

Pricing

Input - $3.00 /M tokens
Cache writes(5m) - $3.75 /M tokens
Cache writes(1h) - $6.00 /M tokens
Cache hits & refreshes - $0.30 /M tokens
Output - $15.00 /M tokens

claude-sonnet-4-6

claude-haiku-4.5

AnthropicText

NEW

Claude Haiku 4.5 is a fast, affordable, and highly capable AI model excelling at programming and agentic tasks. Perfectly combines speed with low cost, ideal for real-time applications, large-scale deployments, and scenarios requiring rapid responses. Provides industry-leading cost-effectiveness while maintaining high-quality output.

Pricing

Input - $1.00 /M tokens
Cache writes(5m) - $1.25 /M tokens
Cache writes(1h) - $2.00 /M tokens
Cache hits & refreshes - $0.10 /M tokens
Output - $5.00 /M tokens

claude-haiku-4-5

claude-opus-4.5

AnthropicText

Claude Opus 4.5 is Anthropic's most powerful model, offering exceptional reasoning capabilities, deep knowledge understanding, and nuanced language processing. Designed for the most demanding tasks requiring complex analysis, creative writing, and sophisticated problem-solving.

Pricing

Input - $5.00 /M tokens
Cache writes(5m) - $6.25 /M tokens
Cache writes(1h) - $10.00 /M tokens
Cache hits & refreshes - $0.50 /M tokens
Output - $25.00 /M tokens

claude-opus-4-5

claude-opus-4.1

AnthropicText

Claude Opus 4.1 delivers premium intelligence with advanced reasoning and analysis capabilities. Excels at complex research tasks, detailed content creation, and nuanced decision-making requiring deep contextual understanding.

Pricing

Input - $15.00 /M tokens
Cache writes(5m) - $18.75 /M tokens
Cache writes(1h) - $30.00 /M tokens
Cache hits & refreshes - $1.50 /M tokens
Output - $75.00 /M tokens

claude-opus-4-1

claude-sonnet-4

AnthropicText

Claude Sonnet 4 provides an excellent balance of intelligence and speed. Features strong coding capabilities, nuanced understanding, and efficient processing for everyday tasks and complex workflows alike.

Pricing

Input - $3.00 /M tokens
Cache writes(5m) - $3.75 /M tokens
Cache writes(1h) - $6.00 /M tokens
Cache hits & refreshes - $0.30 /M tokens
Output - $15.00 /M tokens

claude-sonnet-4

claude-opus-4

AnthropicText

Claude Opus 4 is a fast, affordable, and highly capable AI model excelling at programming and agentic tasks. Perfectly combines speed with low cost, ideal for real-time applications, large-scale deployments, and scenarios requiring rapid responses. Provides industry-leading cost-effectiveness while maintaining high-quality output.

Pricing

Input - $1.00 /M tokens
Cache writes(5m) - $1.25 /M tokens
Cache writes(1h) - $2.00 /M tokens
Cache hits & refreshes - $0.10 /M tokens
Output - $5.00 /M tokens

claude-opus-4

claude-haiku-3.5

AnthropicText

Claude Haiku 3.5 offers impressive speed and cost-efficiency while maintaining strong capabilities. Perfect for high-volume applications, customer service, and tasks requiring quick responses with reliable quality.

Pricing

Input - $0.80 /M tokens
Cache writes(5m) - $1.00 /M tokens
Cache writes(1h) - $1.60 /M tokens
Cache hits & refreshes - $0.08 /M tokens
Output - $4.00 /M tokens

claude-haiku-3-5

claude-haiku-3

AnthropicText

Claude Haiku 3 is the most affordable Claude model, providing fast and efficient responses for simple tasks. Ideal for high-volume, cost-sensitive applications where speed is prioritized over complexity.

Pricing

Input - $0.25 /M tokens
Cache writes(5m) - $0.30 /M tokens
Cache writes(1h) - $0.50 /M tokens
Cache hits & refreshes - $0.03 /M tokens
Output - $1.25 /M tokens

claude-haiku-3

claude-opus-4.7-thinking

AnthropicText

NEW

Claude Opus 4.6 extended thinking model with visible reasoning chains, ideal for deep analytical and complex problem-solving tasks.

Pricing

Input - $5.00 /M tokens
Output - $25.00 /M tokens

claude-opus-4-7-thinking

claude-sonnet-4.6-thinking

AnthropicText

NEW

Claude Sonnet 4.6 extended thinking variant with transparent reasoning output, combining Sonnet-class speed with deep inferential ability.

Pricing

Base Input - $15.00 /M tokens
5m Cache Writes - $18.75 /M tokens
1h Cache Writes - $30.00 /M tokens
Cache Hits - $1.50 /M tokens
Output - $75.00 /M tokens

claude-sonnet-4-6-thinking

claude-opus-4.1-20250805-thinking

AnthropicText

Pinned Claude Opus 4.1 extended thinking snapshot dated 2025/08/05 for stable, versioned production deployments requiring deep reasoning.

Pricing

Base Input - $15.00 /M tokens
5m Cache Writes - $18.75 /M tokens
1h Cache Writes - $30.00 /M tokens
Cache Hits - $1.50 /M tokens
Output - $75.00 /M tokens

claude-opus-4-1-20250805-thinking

claude-opus-4-20250514-thinking

AnthropicText

Pinned Claude Opus 4 extended thinking snapshot dated 2025/05/14, offering deep reasoning with a stable model version.

Pricing

Base Input - $15.00 /M tokens
5m Cache Writes - $18.75 /M tokens
1h Cache Writes - $30.00 /M tokens
Cache Hits - $1.50 /M tokens
Output - $75.00 /M tokens

claude-opus-4-20250514-thinking

claude-opus-4.1-20250805

AnthropicText

Pinned Claude Opus 4.1 snapshot dated 2025/08/05 for workloads requiring a stable, versioned Opus release.

Pricing

Base Input - $5.00 /M tokens
5m Cache Writes - $6.25 /M tokens
1h Cache Writes - $10.00 /M tokens
Cache Hits - $0.50 /M tokens
Output - $25.00 /M tokens

claude-opus-4-1-20250805

claude-opus-4-20250514

AnthropicText

Pinned Claude Opus 4 snapshot dated 2025/05/14 for stable, versioned production deployments.

Pricing

Base Input - $15.00 /M tokens
5m Cache Writes - $18.75 /M tokens
1h Cache Writes - $30.00 /M tokens
Cache Hits - $1.50 /M tokens
Output - $75.00 /M tokens

claude-opus-4-20250514

claude-opus-4.5-20251101

AnthropicText

Pinned Claude Opus 4.5 snapshot dated 2025/11/01 for stable production deployments.

Pricing

Base Input - $5.00 /M tokens
5m Cache Writes - $6.25 /M tokens
1h Cache Writes - $10.00 /M tokens
Cache Hits - $0.50 /M tokens
Output - $25.00 /M tokens

claude-opus-4-5-20251101

claude-sonnet-4.5-20250929

AnthropicText

Pinned Claude Sonnet 4.5 snapshot dated 2025/09/29 for stable, versioned production deployments.

Pricing

Base Input - $3.00 /M tokens
5m Cache Writes - $3.75 /M tokens
1h Cache Writes - $6.00 /M tokens
Cache Hits - $0.30 /M tokens
Output - $15.00 /M tokens

claude-sonnet-4-5-20250929

claude-sonnet-4-20250514

AnthropicText

Pinned Claude Sonnet 4 snapshot dated 2025/05/14 for stable, versioned production deployments.

Pricing

Base Input - $3.00 /M tokens
5m Cache Writes - $3.75 /M tokens
1h Cache Writes - $6.00 /M tokens
Cache Hits - $0.30 /M tokens
Output - $15.00 /M tokens

claude-sonnet-4-20250514

claude-haiku-4.5-20251001

AnthropicText

Pinned Claude Haiku 4.5 snapshot dated 2025/10/01 for stable, versioned production deployments.

Pricing

Base Input - $0.80 /M tokens
5m Cache Writes - $1.00 /M tokens
1h Cache Writes - $1.60 /M tokens
Cache Hits - $0.08 /M tokens
Output - $4.00 /M tokens

claude-haiku-4-5-20251001

claude-3.5-haiku-20241022

AnthropicText

Pinned Claude 3.5 Haiku snapshot dated 2024/10/22, offering fast and affordable performance for high-volume tasks.

Pricing

Base Input - $3.00 /M tokens
5m Cache Writes - $3.75 /M tokens
1h Cache Writes - $6.00 /M tokens
Cache Hits - $0.30 /M tokens
Output - $15.00 /M tokens

claude-3-5-haiku-20241022

gpt-5.6-sol

OpenAIText

NEW

OpenAI GPT-5.6-sol is the flagship model of the gpt-5.6 family, designed for the most demanding reasoning and agent workloads. Adds the new reasoning.mode=pro, reasoning.effort=max, prompt_cache_options, and programmatic tool calling capabilities.

Pricing

Short context Input - $5.00 /M tokens
Short context Cached input - $0.50 /M tokens
Short context Cache writes - $6.25 /M tokens
Short context Output - $30.00 /M tokens
Long context Input - $10.00 /M tokens
Long context Cached input - $1.00 /M tokens
Long context Cache writes - $12.50 /M tokens
Long context Output - $45.00 /M tokens

gpt-5.6-sol

gpt-5.6-terra

OpenAIText

NEW

OpenAI GPT-5.6-terra balances capability and cost for mainstream chat and reasoning workloads across the gpt-5.6 family.

Pricing

Short context Input - $2.50 /M tokens
Short context Cached input - $0.25 /M tokens
Short context Cache writes - $3.125 /M tokens
Short context Output - $15.00 /M tokens
Long context Input - $5.00 /M tokens
Long context Cached input - $0.50 /M tokens
Long context Cache writes - $6.25 /M tokens
Long context Output - $22.50 /M tokens

gpt-5.6-terra

gpt-5.6-luna

OpenAIText

NEW

OpenAI GPT-5.6-luna is the low-cost, high-throughput member of the gpt-5.6 family, optimized for lightweight chat and reasoning workloads at scale.

Pricing

Short context Input - $1.00 /M tokens
Short context Cached input - $0.10 /M tokens
Short context Cache writes - $1.25 /M tokens
Short context Output - $6.00 /M tokens
Long context Input - $2.00 /M tokens
Long context Cached input - $0.20 /M tokens
Long context Cache writes - $2.50 /M tokens
Long context Output - $9.00 /M tokens

gpt-5.6-luna

gpt-4.1

OpenAIText

OpenAI GPT-4.1 model with enhanced capabilities and improved performance.

Pricing

Input - $2.00 /M tokens
Cached input - $0.50 /M tokens
Output - $8.00 /M tokens

gpt-4.1

gpt-5

OpenAIText

HOT

OpenAI's latest flagship model GPT-5, achieving cross-domain breakthroughs in programming, reasoning, and AI agent tasks. Features stronger understanding capabilities, more accurate reasoning processes, and more natural interaction experiences. Supports complex multi-step task planning and execution, representing the current highest level of large language models.

Pricing

Input - $1.25 /M tokens
Cached input - $0.125 /M tokens
Output - $10.00 /M tokens

gpt-5

gpt-5-mini

OpenAIText

NEW

OpenAI GPT-5-mini is a lower-cost variant optimized for lightweight text workloads and high-throughput applications.

Pricing

Input - $0.25 /M tokens
Cached input - $0.025 /M tokens
Output - $2.00 /M tokens

gpt-5-mini

gpt-5.1

OpenAIText

OpenAI GPT-5.1 model with enhanced capabilities and improved performance over GPT-5.

Pricing

Input - $1.25 /M tokens
Cached input - $0.125 /M tokens
Output - $10.00 /M tokens

gpt-5.1

gpt-5.2

OpenAIText

OpenAI GPT-5.2 model with enhanced capabilities.

Pricing

Input - $1.75 /M tokens
Cached input - $0.175 /M tokens
Output - $14.00 /M tokens

gpt-5.2

gpt-5.3-chat

OpenAIText

NEW

OpenAI GPT-5.3-chat is a chat-optimized GPT-5.3 variant for conversational and assistant-style applications.

Pricing

Input - $1.75 /M tokens
Cached input - $0.175 /M tokens
Output - $14.00 /M tokens

gpt-5.3-chat

gpt-5-3-chat

OpenAIText

NEW

OpenAI GPT-5-3-chat model variant for conversational and assistant-style applications.

Pricing

Input - $1.75 /M tokens
Cached input - $0.175 /M tokens
Output - $14.00 /M tokens

gpt-5-3-chat

gpt-5.3-codex

OpenAIText

OpenAI GPT-5.3-codex model optimized for coding tasks.

Pricing

Input - $1.75 /M tokens
Cached input - $0.175 /M tokens
Output - $14.00 /M tokens

gpt-5.3-codex

gpt-5.4

OpenAIText

OpenAI GPT-5.4 model with enhanced capabilities and improved performance.

Pricing

Input - $2.50 /M tokens
Cached input - $0.25 /M tokens
Output - $15.00 /M tokens

270K 以上上下文长度的标准处理费率详情

gpt-5.4

gpt-5.4-mini

OpenAIText

OpenAI GPT-5.4-mini model balancing quality and efficiency for cost-sensitive chat workloads.

Pricing

Input - $0.75 /M tokens
Cached input - $0.075 /M tokens
Output - $4.50 /M tokens

270K 以上上下文长度的标准处理费率详情

gpt-5.4-mini

gpt-5.4-nano

OpenAIText

OpenAI GPT-5.4-nano ultra-low-cost model for lightweight, high-throughput chat scenarios.

Pricing

Input - $0.20 /M tokens
Cached input - $0.02 /M tokens
Output - $1.25 /M tokens

270K 以上上下文长度的标准处理费率详情

gpt-5.4-nano

gpt-4o

OpenAIText

OpenAI GPT-4o model optimized for multimodal tasks with improved speed and efficiency.

Pricing

Input - $2.50 /M tokens
Cached input - $1.25 /M tokens
Output - $10.00 /M tokens

gpt-4o

gpt-5.5

OpenAIText

NEW

OpenAI GPT-5.5 model for premium long-context and advanced reasoning workloads.

Pricing

Input - $5.00 /M tokens
Cached input - $0.50 /M tokens
Output - $30.00 /M tokens

270K 以上上下文长度的标准处理费率详情

gpt-5.5

FlashVSR

FlashVSRVideo

FlashVSR is a fast, high-quality video upscaler that boosts resolution and restores clarity for low-resolution or blurry footage. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

Pricing

4K - $0.16 /5s
2K - $0.12 /5s
1080p - $0.09 /5s
720p - $0.06 /5s

flashvsr

Kimi-k3

MoonshotText

NEW

Kimi K3 is Moonshot AI's flagship model for long-horizon coding and end-to-end knowledge work, with a 1M-token (1,048,576) context window and industry-leading intelligence. It always reasons and supports configuring its reasoning level via the top-level reasoning_effort request field (low / high / max, default max).

Pricing

Input (cache hit) - $0.30 /M tokens
Input (cache miss) - $3.00 /M tokens
Output - $15.00 /M tokens

kimi-k3

Kimi-k2.7-code

MoonshotText

NEW

Kimi-k2.7-code is Moonshot AI's most capable coding model, delivering reliable instruction-following over long contexts and high task success rates on programming and agentic workloads. Supports text, image, and video input with native chain-of-thought (CoT) output via the reasoning_content field. This endpoint also supports kimi-k2.5, a multimodal model offering both thinking and non-thinking modes.

Pricing

Input (cache hit) - $0.19 /M tokens
Input (cache miss) - $0.95 /M tokens
Output - $4.00 /M tokens

kimi-k2.7-code

hy3-preview

TencentText

NEW

hy3-preview is Tencent's Hunyuan 3 preview model, served through an OpenAI-compatible chat completions interface. It delivers strong general reasoning and instruction-following across long contexts, and can emit its chain-of-thought via the reasoning_content field alongside the final content.

Pricing

Input - $0.167 /M tokens (0~16K)
Output - $0.556 /M tokens (0~16K)
Cache hit - $0.056 /M tokens (0~16K)
Input - $0.222 /M tokens (16K~32K)
Output - $0.889 /M tokens (16K~32K)
Cache hit - $0.083 /M tokens (16K~32K)
Input - $0.278 /M tokens (32K~256K)
Output - $1.111 /M tokens (32K~256K)
Cache hit - $0.111 /M tokens (32K~256K)

hy3-preview

Sonilo

SoniloAudio

NEW

Sonilo audio model for music and sound effect generation from text or video inputs, plus automatic audio ducking. Billed per second of output audio.

Pricing

$0.009 / second (Video → Music)
$0.00225 / second (Text → Music)
$0.009 / second (Video → SFX)
$0.0018 / second (Text → SFX)
$0.0006 / second (Audio Ducking)

sonilo

Menu

One Unified API