alibaba/wan2.7-videoedit

wan2.7-videoedit
Docs
Schema

Wan 2.7 Video Editing model supports video style modification and video editing with multi-modal inputs (text/image/video). Processing time: 1-5 minutes.

$0.1000~$0.1400/sec
video-to-video

Input

Negative prompt describing unwanted content
Text prompt describing desired video modifications (supports Chinese and English, up to 5000 characters)
Audio handling mode (auto: intelligent decision based on prompt, origin: preserve original audio)
auto
Video duration in seconds (2-10 = truncate to specified length)
5
Enable intelligent prompt rewriting (improves quality for short prompts but increases processing time)
Video aspect ratio (defaults to input video ratio if not specified)
ratio
Reference image URLs for style transfer (optional, max 4). Combined with video_url, the total media count must be ≤ 5
Hint: Drag and drop files, paste from clipboard (Ctrl/Cmd+V), or provide a URL.
Video resolution tier
1080P
Random seed for reproducibility (note: results may vary even with same seed)
Input video URL (HTTP/HTTPS/OSS) or Base64 encoded data (required)
Hint: Drag and drop files, paste from clipboard (Ctrl/Cmd+V), or provide a URL.

Result

No results yet

Run the model to preview the output here.

README

Parameters

Name Description Type Required Enums
negative_prompt Negative prompt describing unwanted content string No -
prompt Text prompt describing desired video modifications (supports Chinese and English, up to 5000 characters) string No -
audio_setting Audio handling mode (auto: intelligent decision based on prompt, origin: preserve original audio) string No auto, origin
duration Video duration in seconds (2-10 = truncate to specified length) integer No 2, 3, 4, 5, 6, 7, 8, 9, 10
prompt_extend Enable intelligent prompt rewriting (improves quality for short prompts but increases processing time) boolean No true, false
ratio Video aspect ratio (defaults to input video ratio if not specified) string No 16:9, 9:16, 1:1, 4:3, 3:4
reference_images Reference image URLs for style transfer (optional, max 4). Combined with video_url, the total media count must be ≤ 5 string[] No -
resolution Video resolution tier string No 720P, 1080P
seed Random seed for reproducibility (note: results may vary even with same seed) integer No -
video_url Input video URL (HTTP/HTTPS/OSS) or Base64 encoded data (required) string Yes -

Pricing

Unit: $/sec

Dimension Pricing
resolution: 720P 0.1000
resolution: 1080P 0.1400
  • alibaba/wan2.7-image: Wan 2.7 Image is the standard text-to-image model with faster generation speed. Supports up to 2K resolution, thinking mode, sequential generation, and custom color themes. Does not support 4K.
  • alibaba/wan2.7-image-pro: Wan 2.7 Image Pro is the professional text-to-image model supporting up to 4K resolution, thinking mode for enhanced reasoning, sequential multi-image generation, and custom color themes. Supports Chinese and English prompts up to 5000 characters.
  • alibaba/wan2.7-image-edit: Wan 2.7 Image Edit is the standard image editing model with faster generation speed. Supports multi-image reference, interactive editing, sequential generation, and custom color themes. Max 2K output. Does not support 4K.
  • alibaba/wan2.7-image-pro-edit: Wan 2.7 Image Pro Edit is the professional image editing model supporting multi-image reference generation, interactive bounding-box editing, sequential multi-image generation, and custom color themes. Supports 1-9 input images, max 2K output.
  • alibaba/wan2.7-r2v: Wan 2.7 Reference-to-Video model generates videos from reference media (images and videos) with prompts. Requires at least one media parameter (reference_images, reference_videos, or first_frame_image). Supports multi-subject references, storyboard generation, and custom audio for voice cloning.
  • alibaba/wan2.7-t2v: Wan 2.7 text-to-video model generates high-quality videos from text prompts using the new protocol (resolution+ratio instead of size). Supports multi-shot narrative, automatic dubbing, custom audio, 720P/1080P resolutions, and 2–15 second durations.
  • alibaba/wan2.7-i2v: Wan 2.7 image-to-video model supports three task modes with flattened media parameters: first-frame generation, first-last-frame generation, and continuation. Parameter combination must conform to allowed mode combinations.

More in this series