Kling O1
The most reference-capable model in the Kling family. Kling O1 combines reasoning-enhanced generation with support for multiple image references and video references — giving you maximum creative control.
What is Kling O1?
Kling O1 is Kuaishou's most reference-capable video model. It combines the reasoning-enhanced generation of the O series with an expanded input format: multiple image references and video references can all be provided in a single generation request.
The multi-image reference capability allows you to supply several still images simultaneously — subject references, environment shots, lighting examples — giving the model rich visual context before it begins generation. The video reference capability goes a step further, allowing you to provide an existing video clip as a visual and motion reference alongside your images.
This combination — reasoning, multi-image, and video reference — makes Kling O1 the most powerful creative direction tool in the Kling family. It's designed for use cases where visual consistency and reference fidelity matter most, and where the complexity of the brief exceeds what a text prompt alone can communicate.
Multi-image reference
Multiple images per generation
Video reference
Existing video as visual guide
Reasoning-enhanced
Complex prompt understanding
1080p output
Full HD cinematic quality
How to generate video with Kling O1 on project.video
Open the composer
Navigate to your project.video dashboard and open the composer. Kling O1 is available in the model selector under the Kling family.
Select Kling O1
Choose Kling O1 when reference fidelity and reasoning are your priority. The composer shows all available input slots for this model.
Upload references
Add multiple image references — subject, environment, lighting. Add a video reference clip if you have one. These inputs guide the model's generation alongside your text prompt.
Write a detailed prompt and generate
Write a rich, detailed prompt. Kling O1's reasoning processes the full context — your text plus all reference inputs — before generating the output.
Technical specs
Best use cases
Reference-heavy creative briefs
When a client has extensive visual brand guidelines, provide multiple reference images — approved visual styles, product shots, environment examples — and let Kling O1 synthesise them into a generation that meets the brief.
Continuation and stylistic matching
Provide an existing video clip as a reference to match the visual style, motion character, and aesthetic of an established piece. Kling O1 can generate new content that feels like it belongs in the same series.
Complex character-driven scenarios
When a generation requires consistent character appearance across multiple reference angles plus a specific motion style from a video reference, Kling O1's combined input capability handles this level of complexity.
Premium production pre-visualization
For film and advertising pre-vis, supply mood boards, environment references, and motion references simultaneously. Kling O1's reasoning processes the full creative context into coherent, intentional output.
Example prompts
These prompts benefit from Kling O1's multi-reference capability — pair them with image and video references in the composer.
"A premium whiskey glass on a dark oak bar, candlelight reflecting in the liquid, same visual atmosphere as provided environment reference images, matching shot style from reference video, 16:9"
"Fashion editorial in a sun-bleached desert landscape, model in flowing white dress, consistent with provided look reference images, camera movement matching reference clip, 9:16"
"Architectural visualisation of a glass-walled house at dusk, interior lighting visible through walls, matching provided material reference images and environment mood, slow pan, 16:9"