IMAGE MODEL Apr 2025 Kuaishou Last updated: Apr 29, 2026

Kling 2.0

Kuaishou's Text-to-Video Flagship

Kuaishou's (Chinese short-video platform, TikTok competitor) text-to-video flagship. Originally Kling 1.0 launched June 2024 before Sora was publicly available — briefly making it the only production-accessible cinema-quality text-to-video product anywhere. Kling 2.0 (April 2025) extended duration to 3 minutes and added image-to-video plus multi-shot coherence.

Try demo

Official ↗

Why it matters

Kling represents a structural shift: for the first time in a major generative AI category, the first-mover advantage went to a Chinese platform rather than a US lab. The category (text-to-video) may permanently bifurcate into US-flagship-quality and Chinese-platform-scale segments.

Core Capabilities

Generative

Produces images, video, audio, or other media.

Vision

Understands images, scenes, and visual context.

Multimodal

Combines text, vision, and audio in one model.

Context Window

Context window not disclosed.

Availability

API

Available

Product / App

Available

Open Source

Not released

Enterprise

Contact sales

Pricing Model

Subscription

Bundled inside the host product.

Subscription

Capability / Performance

Where this model sits relative to the middle 60% of models in the tree. All scores are 0–10 (higher is better).

Lower 20% Upper 80% This model

Quality

No data reported · placeholder

5.0

Speed

No data reported · placeholder

5.0

Control

No data reported · placeholder

5.0

Consistency

No data reported · placeholder

5.0

Lower 20% 20th percentile — 20% of models score below this This model Where the current model lands Upper 80% 80th percentile — only 20% of models score above this Percentile boundaries are computed across every model in the tree that reports the underlying benchmark for each capability.

What it feels like

AI filmmakers consistently rate Kling among the top 2 image-to-video tools, behind only Veo 3
Curious Refuge: 'Best image-to-video tool in 2025' with motion fluidity 'beats everyone except maybe Veo 3'
2.5 Turbo blind-test win rates 285% vs Seedance 1.0 mini, 212% vs Veo 3 Fast, 160% vs Seedance 1.0
2.6 (Dec 2025) added simultaneous audio-visual generation — semantic alignment of voice rhythm + visuals
5-second 1080p video at 25 credits in 2.5 Turbo — ~30% cheaper than 2.1
Strong character consistency and physical motion — preferred for cinematic-style storyboarding