Models

Video generation

By Jianchao Ci, CEO & CTO6 min read2026-04-06

TL;DR

Compare AI video models by input type, duration, features, and cost per second.

Text-to-video vs. image-to-video

Text-to-video models generate motion from a prompt alone. Image-to-video models animate from a starting frame, giving you direct control over the first shot.

Use the table to compare duration range, supported inputs, available controls, and cost per generated second.

How to read the table

  • Input tells you whether the model starts from prompt only or from prompt plus a start frame.
  • Output summarizes duration, resolution, and framing controls.
  • Features call out start-frame input, end-frame guidance, audio, and other workflow-changing options.
  • Price is shown in credits per generated second.
  • [Generation cost](/docs/generation-cost)
  • [AI video generation guide](/blog/ai-video-generator-guide)
  • [AI image-to-video guide](/docs/ai-image-to-video-guide)

Related KrafLayer tools

Browse all AI ecommerce tools · Pricing and credit costs