
Vidu Q3 Pro text-to-video model. Higher quality generation with support for `prompt`, `duration` (1-16s), `aspect_ratio`, `resolution`, and `seed`.

Vidu Q3 Turbo text-to-video model. Fast generation with support for `prompt`, `duration` (1-16s), `aspect_ratio`, `resolution`, and `seed`.

Wan 2.7 text-to-video model generates high-quality videos from text prompts using the new protocol (resolution+ratio instead of size). Supports multi-shot narrative, automatic dubbing, custom audio, 720P/1080P resolutions, and 2–15 second durations.

Google Veo 3.1 Lite text-to-video model. Supports `prompt`, `negativePrompt`, `aspectRatio`, `durationSeconds`, `resolution` (up to 1080p), and `personGeneration`.

HappyHorse text-to-video model generates physically realistic and smoothly animated video content from text prompts. The model focuses on physical realism and motion fluidity, supporting various resolution and aspect ratio combinations with 3-15 seconds duration.

Kling V3 text-to-video model using the simplified public single-shot interface with 3-15 second output duration.

MiniMax T2V-01 delivers professional camera movement control, transforming text prompts into cinematic video clips with dynamic shots.

T2V-01-Director offers precise camera control for creating professional video clips with cinematic movements through a variety of lens instructions.

Hailuo 02 masters text-to-video generation with exceptional instruction following and sets a new standard in visual realism via extreme physics.