Category
API
App
Hi~
Enterprise AI Resource Hub
Let AI find the answer for every need
Latest
Popular
Recommended
doubao-seedream-5-0-260128
Seedream 5 is ByteDance's latest multimodal image model.
Pricing:
$0.035/call
qwen3.5-122b-a10b
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.12/1M tokensstarting from
Output:
$0.92/1M tokensstarting from
qwen3.5-27b
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.09/1M tokensstarting from
Output:
$0.69/1M tokensstarting from
qwen3.5-35b-a3b
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.06/1M tokensstarting from
Output:
$0.46/1M tokensstarting from
qwen3.5-flash
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.03/1M tokensstarting from
Output:
$0.29/1M tokensstarting from
gemini-3.1-pro-preview
The most powerful agentic and coding model, with the best multimodal understanding capabilities
Input:
$2/1M tokensstarting from
Output:
$12/1M tokensstarting from
claude-sonnet-4-6-thinking
The latest mid-range model released by Anthropic
Input:
$3/1M tokensstarting from
Output:
$15/1M tokensstarting from
claude-sonnet-4-6
The latest mid-range model released by Anthropic
Input:
$3/1M tokensstarting from
Output:
$15/1M tokensstarting from
qwen3.5-397b-a17b
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.12/1M tokensstarting from
Output:
$0.69/1M tokensstarting from
qwen3.5-plus
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.12/1M tokensstarting from
Output:
$0.69/1M tokensstarting from
Doubao-Seed-2.0-Code
ByteDance has launched a large-scale model dedicated to code, focusing on code generation, completion, interpretation, and debugging, with multilingual support and context-aware capabilities.
Input:
$0.46/1M tokensstarting from
Output:
$2.29/1M tokensstarting from
Doubao-Seed-2.0-mini
ByteDance has launched an ultra-lightweight large language model specifically designed for resource-constrained scenarios and large-scale foundational tasks.
Input:
$0.115/1M tokensstarting from
Output:
$0.115/1M tokensstarting from
Doubao-Seed-2.0-lite
ByteDance’s lightweight large language model, suitable for everyday conversations and light-duty tasks that are cost-sensitive and demand high response speeds.
Input:
$0.128/1M tokensstarting from
Output:
$0.78/1M tokensstarting from
Doubao-Seed-2.0-pro
ByteDance’s multimodal large model has achieved significant breakthroughs in code generation, tool invocation, and long-context understanding.
Input:
$0.46/1M tokensstarting from
Output:
$2.28/1M tokensstarting from
MiniMax-M2.5-highspeed
The ultra-fast version of Mininax-M2.5—same performance, yet faster and more agile (output speed approximately 100 tps, compared to 60 tps for M2.5).
Input:
$0.6/1M tokens
Output:
$4.8/1M tokens
MiniMax-M2.5
MiniMax’s text-generation model achieves outstanding performance in complex task scenarios such as programming, tool invocation, search, and office work, with extremely low costs and high inference efficiency.
Input:
$0.3/1M tokens
Output:
$1.2/1M tokens
Pro/zai-org/GLM-5
Zhipu AI has launched a new-generation flagship foundation large model, specifically designed for complex system engineering and long-term agent tasks, offering a real programming experience that closely rivals Claude Opus 4.5.
Input:
$0.572/1M tokensstarting from
Output:
$2.58/1M tokensstarting from
glm-5
Zhipu AI has launched a new-generation flagship foundation large model, specifically designed for complex system engineering and long-term agent tasks, offering a real programming experience that closely rivals Claude Opus 4.5.
Input:
$0.6/1M tokensstarting from
Output:
$2.6/1M tokensstarting from
Jimeng Video Generation 3.0
The video generation model launched by Volcano Engine supports up to 1080P high-definition rendering, making it a cost-effective choice that balances generation quality and speed.
Pricing:
$0.05/second
starting from
Jimeng Video Generation 3.0 pro
Volcano Engine’s flagship video-generation model features multi-camera storytelling capabilities and professional-grade 1080P quality.
Pricing:
$0.16/second
View Details
Try Now
Nano Banana Canvas
Edit images, including annotation and stitching; image processing can also be performed via prompts.
Pricing:
Depends on the specific model used
View Details
Try Now
3D Camera Stuido
Perform multi-angle transformations and background blending on the image.
Pricing:
Depends on the specific model used
View Details
Try Now
Nano Banana MD
Use Nano-Banana to automatically add images to articles.
Pricing:
Depends on the specific model used
View Details
Try Now
Nano Banana PPT
Use Nano-Banana to create a PPT.
Pricing:
Depends on the specific model used
View Details
Try Now
AI Video Generator
Turn text and images to video
Pricing:
Depends on the specific model used
Jimeng Video Generation 3.0
The video generation model launched by Volcano Engine supports up to 1080P high-definition rendering, making it a cost-effective choice that balances generation quality and speed.
Pricing:
$0.05/second
starting from
Jimeng Video Generation 3.0 pro
Volcano Engine’s flagship video-generation model features multi-camera storytelling capabilities and professional-grade 1080P quality.
Pricing:
$0.16/second
Kling O3 video generation
Kuaishou has launched a flagship AI video-generation model that covers four core scenarios: text-to-video, image-to-video, reference-video-based generation, and intelligent video editing.
Pricing:
$0.084/second
starting from
Kling v3(Image-to-video)
Kuaishou's flagship image-to-video model
Pricing:
$0.036/Second
starting from
Kling v3(Text-to-video)
Kuaishou's flagship text-to-video model
Pricing:
$0.036/Second
starting from
qwen3.5-122b-a10b
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.12/1M tokensstarting from
Output:
$0.92/1M tokensstarting from
qwen3.5-27b
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.09/1M tokensstarting from
Output:
$0.69/1M tokensstarting from
qwen3.5-35b-a3b
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.06/1M tokensstarting from
Output:
$0.46/1M tokensstarting from
qwen3.5-flash
Alibaba’s Qwen3.5 series supports multimodal capabilities.
Input:
$0.03/1M tokensstarting from
Output:
$0.29/1M tokensstarting from
gemini-3.1-pro-preview
The most powerful agentic and coding model, with the best multimodal understanding capabilities
Input:
$2/1M tokensstarting from
Output:
$12/1M tokensstarting from
doubao-seedream-5-0-260128
Seedream 5 is ByteDance's latest multimodal image model.
Pricing:
$0.035/call
Kling Image O3
Kuaishou’s AI image generation model features two core capabilities: Text-to-Image and Image Edit.
Pricing:
$0.028/image
starting from
Kling O3(Text-to-Image)
Kuaishou’s flagship text-to-image model supports ultra-high 4K resolution.
Pricing:
$0.028/image
starting from
Grok-Imagine-Image
xAI’s text-to-image product—a cost-effective, easy-to-integrate image generation interface.
Pricing:
$0.05/call
starting from
Z-Image
The latest image-generation model released by Tongyi Lab
Pricing:
$0.05/call
Hunyuan3d (Multi-Interface Integration
Tencent’s flagship generative AI 3D modeling product that generates high-quality 3D models from text or images.
Pricing:
$0.02/Point
starting from
Remove Background
The image background removal tool from Photoroom
Pricing:
$0.022/call
qwen-image-edit-plus-2025-12-15
The Tongyi Qianwen series Image Editing Plus model further optimizes inference performance and system stability based on the initial Edit model.
Pricing:
$0.03/image
Qwen-Image-Layered
A model that decomposes an image into multiple RGBA layers endows the image with inherent editability.
Pricing:
$0.05/call
wavespeed-ai/image-captioner
High-precision image understanding and description models
Pricing:
$0.001/call
speech-2.8-hd
High-performance text-to-speech model launched by MiniMax
Pricing:
$52.5/1M characters
starting from
speech-2.8-turbo
High-performance text-to-speech model launched by MiniMax
Pricing:
$30/1M characters
starting from
music-2.5
MiniMax’s flagship AI music generation model delivers both high-fidelity sound quality and studio-level control.
Pricing:
$0.15/call
GLM-ASR-2512
Zhipu's next-generation speech recognition model supports real-time conversion of speech into high-quality text.
Pricing:
$0.025/M tokens
GLM-TTS-Clone
A 3-second voice sample clones the speaker’s tone and speech patterns.
Pricing:
$0.9/call
GLM-OCR Layout analysis
The layout analysis model released by Zhipu is used to parse the layout of documents and images and extract text content.
Pricing:
$0.03/M Tokens
DeepSeek-OCR
DeepSeek-OCR from Spohnet
Pricing:
$0.015/call
PaddleOCR-VL
PaddleOCR-VL from Spohnet
Pricing:
$0.002/call
Link-to-Image
Our own services
Pricing:
$0.001/call
MinerU-2.5
MinerU, a PDF parsing tool
Pricing:
$0.001/Page
qwen3-rerank
Text ranking model trained based on the Qwen LLM foundation
Input:
$0.07/1M tokens
Output:
Free
rerank-2.5
Re-ranking model from Voyage AI
Input:
$0.1/1M tokens
Output:
$0.1/1M tokens
voyage-3-large
Embedded model from Voyage AI
Input:
$0.2/1M tokens
Output:
$0.2/1M tokens
rerank-2.5-lite
Re-ranking model from Voyage AI
Input:
$0.05/1M tokens
Output:
$0.05/1M tokens
voyage-context-3
Embedded model from Voyage AI
Input:
$0.2/1M tokens
Output:
$0.2/1M tokens
Pay with 302
Quickly integrate payment functionality into your product
Pricing:
Free
Deploy web pages by one-click
One-click web deployment
Pricing:
$0.001/call
starting from
Anime to Real Person
Explore various creative approaches to anime-to-live-action adaptations
Pricing:
Optimize tokens generated by prompts + image generation API costs
Clothing Flat Lay
Explore various creative ways to use clothing flat lays
Pricing:
Optimize tokens generated by prompts + image generation API costs
3D Doll
Explore various creative ways to play with 3D dolls
Pricing:
Optimize tokens generated by prompts + image generation API costs