kling/kling-v3-omni-video

kling-v3-omni-video
Docs
Schema

Kling V3 Omni video model for advanced prompt-guided video generation with simplified intelligent multi-shot support and flattened media references.

$0.1500~$0.2300/sec
video-to-video

Input

Video generation prompt.
Video aspect ratio.
aspect_ratio
Video duration in seconds. Public API accepts integer values. Valid values are 3-15 when `videos` is not supplied.
5
Optional reference image list.
Hint: Drag and drop files, paste from clipboard (Ctrl/Cmd+V), or provide a URL.
Generation mode.
pro
Enable intelligent multi-shot generation.
Required when `multi_shot` is true. Only `intelligence` is supported in the public API.
shot_type
Audio generation switch. Must remain `off` whenever `videos` is provided.
off
Optional reference video list. When provided, `sound` must be `off` and `duration` must be omitted.
Hint: Drag and drop files, paste from clipboard (Ctrl/Cmd+V), or provide a URL.

Result

No results yet

Run the model to preview the output here.

README

Parameters

Name Description Type Required Enums
prompt Video generation prompt. string Yes -
aspect_ratio Video aspect ratio. string No 16:9, 9:16, 1:1
duration Video duration in seconds. Public API accepts integer values. Valid values are 3-15 when videos is not supplied. integer No 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15
images Optional reference image list. string[] No -
mode Generation mode. string No std, pro
multi_shot Enable intelligent multi-shot generation. boolean No true, false
shot_type Required when multi_shot is true. Only intelligence is supported in the public API. string No intelligence
sound Audio generation switch. Must remain off whenever videos is provided. string No on, off
videos Optional reference video list. When provided, sound must be off and duration must be omitted. string[] No -

Pricing

Unit: $/sec

Dimension Pricing
mode: std / sound: on 0.1500
mode: pro / sound: on 0.1900
mode: std / sound: off 0.1700
mode: pro / sound: off 0.2300
  • kling/kling-v3-t2i: Kling V3 text-to-image model with improved prompt adherence and 1K/2K output support for higher-fidelity creative generation.
  • kling/kling-v3-t2v: Kling V3 text-to-video model using the simplified public single-shot interface with 3-15 second output duration.
  • kling/kling-v3-i2i: Kling V3 image-to-image model for higher-fidelity editing and restyling with 1K/2K output support.
  • kling/kling-v3-omni-image: Kling V3 Omni image model supporting single-image and series generation with flattened image references, element references, and up to 4K output.
  • kling/kling-v3-i2v: Kling V3 image-to-video model supporting prompt-driven animation from a single reference image with 3-15 second output duration. The public API supports image, prompt, negative_prompt, sound, cfg_scale, mode, aspect_ratio, and duration.

More in this series