Parameters
| Name | Description | Type | Required | Enums |
|---|---|---|---|---|
| prompt | Video generation prompt. | string | Yes | - |
| aspect_ratio | Video aspect ratio. | string | No | 16:9, 9:16, 1:1 |
| duration | Video duration in seconds. Public API accepts integer values. Valid values are 3-15 when videos is not supplied. |
integer | No | 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 |
| images | Optional reference image list. | string[] | No | - |
| mode | Generation mode. | string | No | std, pro |
| multi_shot | Enable intelligent multi-shot generation. | boolean | No | true, false |
| shot_type | Required when multi_shot is true. Only intelligence is supported in the public API. |
string | No | intelligence |
| sound | Audio generation switch. Must remain off whenever videos is provided. |
string | No | on, off |
| videos | Optional reference video list. When provided, sound must be off and duration must be omitted. |
string[] | No | - |
Pricing
Unit: $/sec
| Dimension | Pricing |
|---|---|
| mode: std / sound: on | 0.1500 |
| mode: pro / sound: on | 0.1900 |
| mode: std / sound: off | 0.1700 |
| mode: pro / sound: off | 0.2300 |
Related Models
- kling/kling-v3-t2i: Kling V3 text-to-image model with improved prompt adherence and 1K/2K output support for higher-fidelity creative generation.
- kling/kling-v3-t2v: Kling V3 text-to-video model using the simplified public single-shot interface with 3-15 second output duration.
- kling/kling-v3-i2i: Kling V3 image-to-image model for higher-fidelity editing and restyling with 1K/2K output support.
- kling/kling-v3-omni-image: Kling V3 Omni image model supporting single-image and series generation with flattened image references, element references, and up to 4K output.
- kling/kling-v3-i2v: Kling V3 image-to-video model supporting prompt-driven animation from a single reference image with 3-15 second output duration. The public API supports
image,prompt,negative_prompt,sound,cfg_scale,mode,aspect_ratio, andduration.




