Bring Your Vision to
Cinematic Reality with Wanx AI

The groundbreaking video generation platform that turns ideas into stunning, high-definition videos in seconds.

Alibaba's Text-to-Visual Magic Machine

Alibaba Cloud's Wanx 2.1 is a next-gen AI marvel that turns words into stunning visuals. This powerhouse from the Tongyi family transforms text prompts into eye-catching imagery, showcasing Alibaba's impressive leap forward in creative AI technology.

Wanx 2.1

Compatible with Consumer GPUs

Wan 2.1 operates with just 8.19 GB of VRAM, making it accessible for most consumer-grade GPUs. Its performance rivals some closed-source models.

Versatile Capabilities

Wan excels in various tasks: Text-to-Video, Image-to-Video, Video Editing, Text-to-Image, and Video-to-Audio, pushing the boundaries of video generation

High-Performance Video VAE

Wan encode and decode 1080P videos of any length while maintaining temporal consistency, making it a powerful foundation for video and image generation.

Advanced AI for Creative Content

Experience next-generation AI that transforms simple text into cinematic visuals while understanding context and offering unprecedented creative control.

Speak with ChatGPT
Quality Output, Zero Complexity

Wanx AI’s advanced diffusion models generate 1080p videos with cinematic lighting, lifelike physics, and fluid motion – all from a simple text prompt.

Chat about images
Smart Context Understanding

Wanx AI doesn’t just process words, as it does multimodal engine analyzes of emotional tone, cultural context and physical realism.

Real-time translation
Total Creative Control

Go beyond basic prompts with such features as style fusion, motion control and ssset swapping.

Wan 2.1 Models

There are several models available, each designed for different capabilities and hardware requirements, ranging from high-performance video generation to efficient processing on consumer-grade GPUs.

Wan2.1-I2V-14B

• Available in 720P and 480P resolutions.
• Outperforms both open-source and closed-source models, achieving state-of-the-art (SOTA) performance.
• Capable of generating videos with intricate visual details and dynamic motion based on text and image inputs.

Get API Key
Enhanced Reasoning
Audio ASR Performance

Wan2.1-T2V-14B

• Supports 480P and 720P video generation.
• Establishes a new SOTA benchmark among open and closed-source models.
• Excels in producing high-quality visuals with advanced motion dynamics.
• The only model that supports generating videos with both Chinese and English text.

Get API Key

Wan2.1-T2V-1.3B

• Designed for consumer-grade GPUs, requiring only 8.19 GB of VRAM for a 5-second 480P video.
• Runs efficiently, generating output in just 4 minutes on an RTX 4090.
• Through pre-training and distillation, it surpasses larger open-source models and even rivals some advanced closed-source solutions.

Audio Translation Performance
Llama 3 intro

Why Choose AI/ML API solution for AI Search?

AI/ML API  provides scalability, faster deployment, and access to 200+ advanced machine learning models without the need for extensive in-house expertise or infrastructure.

Mixtral icon

Easy To Use

Our API allows seamless integration of powerful AI capabilities into your applications, regardless of your coding experience. Simply swap your API key to begin using the AI/ML API.

Google Icon

Scalable

AI/ML API provides flexibility for business growth since you can scale resources by purchasing more tokens as needed, ensuring optimal performance and cost efficiency

OpenAI Icon

Affordable

We offer flat, predictable pricing, payable by card or cryptocurrency, keeping it the lowest on the market and affordable for everyone.

Audio ASR Performance

Wanx 2.1 API Coming Soon

Book a call with our sales team to be among the first to use Wanx 2.1 advanced large-scale video generative models API.

Ready to get started? Get Your API Key Now!

Get API Key