
Video generation models as world simulators
🎬 AI VideoWe explore large-scale training of generative models on video data. Specifically, we train text-cond...
We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes. Our largest model, Sora, is capable of generating a minute of high fidelity video. Our results suggest that scaling video generation models is a promising path towards building general purpose simulators of the physical world.
Related Tools

Pictory - AI Video Generator
Pictory's powerful AI enables you to create and edit professional quality videos using text, no...

Descript – AI Video & Podcast Editor | Free, Online
Descript makes editing video and audio as easy as editing text. Record, transcribe, edit, and publis...

蔓藤AI - 智能创作平台
蔓藤AI提供数字人创作、声音克隆、视频换脸、智能文案生成等AI服务,让创作更简单高效

Synthesia: #1 AI Video Platform for Business
Create AI generated videos from text with the most advanced AI avatars and voiceovers in 160+ langua...