The wan 2.6 video model by Alibaba delivers high-fidelity cinematic output with superior temporal consistency. Grounded in a Causal Diffusion Transformer, it excels at complex physics and precise motion control for professional video production.
The wan 2.6 video model uses a Causal Diffusion Transformer (T2V-Turbo) architecture. This technical approach prevents objects from morphing or shifting their identity mid-clip, ensuring that a 10-second video maintains the same character and environment details from the first frame to the last.
What is the max resolution for wan video output?
Native generation for wan is 720p, which can be upscaled to high-fidelity 1080p. The model supports various aspect ratios including 16:9, 9:16, and 1:1, making it versatile for both cinematic film production and social media content like TikTok or Instagram Reels.
Can I use wan for Image to Video (I2V) tasks?
Yes, wan 2.6 video offers robust Image to Video (I2V) features. It provides strict adherence to the reference image, ensuring the generated motion starts exactly from the provided pixels. This is ideal for animating game assets or maintaining character consistency across different shots.
How long does a wan 2.6 video generation take?
Generation typically takes 20 to 30 seconds of processing time for every one second of video produced. A standard 5-second wan clip usually finishes within 2 to 3 minutes, depending on the chosen quality tier and current server load on the GPTProto platform.
Is my data used to train the wan model?
Data privacy is a priority at GPTProto. Prompts and videos generated using the wan 2.6 video API are not used for model training. We offer an opt-out policy for all enterprise traffic to ensure your creative assets and proprietary prompts remain private and secure.
How does wan 2.6 video pricing work?
Standard 720p quality is priced at $0.08 per second, while Pro 1080p quality is $0.15 per second. Image to Video tasks are billed at the same rate based on the output duration. We also offer a 20% discount for batch generations of over 50 concurrent clips.