Wan 2.1: Advanced video model excelling in generative tasks.
Wan 2.1, developed by Alibaba's Wan AI team, is a state-of-the-art video foundation model designed for advanced generative video tasks. Supporting Text-to-Video (T2V), it incorporates groundbreaking innovations to deliver high-quality outputs with exceptional computational efficiency.
Wan 2.1 is designed for applications in:
The model supports multilingual text generation, including Chinese and English.
Wan 2.1 is built on the diffusion transformer paradigm with several innovative features:
Wan 2.1 achieves an impressive 84.7% VBench score, excelling in dynamic scenes, spatial consistency, and aesthetics. It generates 1080p video at 30 FPS with realistic motion, thanks to its advanced space-time attention mechanism. As a leading open-source video generation model, it rivals proprietary alternatives like Sora, though they may outperform it in certain areas.
The model is available on the AI/ML API platform as "Wan 2.1" .
Detailed API Documentation is available here.
Alibaba emphasizes responsible usage of Wan 2.1 for ethical applications in content creation while discouraging misuse such as deepfake generation or inappropriate content creation.
Wan 2.1 is licensed under Apache 2.0, allowing both commercial and research use with transparent terms.
Get Wan 2.1 API here.
We're sorry, but it looks like we don't currently have a model that matches your desired characteristics in our database.
However, we're constantly updating our offerings and would love to hear from you! Please sign up and connect with us on Discord to request the addition of specific AI models. Our team is dedicated to providing the best tools for your needs and will work quickly to add the model you're looking for.
Thank you for helping us improve our service!