Open-Sora 2.0 - Open-source video AI with low training costs

The Singapore-based AI startup HPC-AI Tech has introduced a video AI model with lower training costs compared to other similar models. The startup called the new model Open-Sora 2.0.

Image with the writing Open-Sora
Image: Generated with AI

The details

  • According to the research paper, Open-Sora 2.0 needs about $200,000 for training costs. This is 5-10 times less than comparable models like Movie Gen or Step-Video-T2V.
  • HPC-AI Tech used an affordable training pipeline with three main stages: (1) training a text-to-video model using low-resolution video data, (2) training an image-to-video model with low-resolution video data, and (3) fine-tuning an image-to-video model on high-resolution videos. The developers also saved computing resources by using pre-trained image models like Flux.
  • Even though it has lower training costs, Open-Sora 2.0 performs just as well as leading video generation models like HunyuanVideo and Runway Gen-3 Alpha. In addition, Open-Sora 2.0 nearly matches the performance of OpenAI’s Sora.
  • Open-Sora 2.0 can generate videos from text and images at resolutions up to 768×768 pixels for videos up to 5 seconds. The model is available on Hugging Face.

Our thoughts

This paper shows that high-quality video generation models can be developed with controlled costs through an optimized training strategy and pre-trained models.

However, Open-Sora has a lower resolution compared to OpenAI’s Sora, and its maximum length of five seconds is quite short. In comparison, OpenAI’s Sora can generate videos up to 60 seconds with a resolution of up to 1080p.

In addition, current diffusion models often cause unexpected problems, such as distorted objects and unnatural physical effects, which require further research.

More information: 🔗 GitHub | OpenSora Gallery | arXiv


Explore our premium blog articles ✨ Read without banner ads? Become a member or log in

Magic AI tool

Have you ever dreamed of turning your YouTube videos or podcasts into new content, e.g., social media posts? Then, CastMagic is right for you! It is a powerful tool for video creators and podcasters. It supports many languages (e.g., English, German, French, and many more).

In addition, you can also use it for meetings. Imagine you have a sales call. Then, you need a summary of the call, including the customer’s name, asked questions, next steps, and so on. Right? No problem! CastMagic can do all of this for you.

👉🏽 Try it for FREE today!*

Hand-picked articles


😀 Do you enjoy our content? If so, why not support us with a small financial contribution? As a supporter, you can comment on newsletter editions (e-mail version) and read our website without banner ads.

AI and Coding Merch ✨ Read without banner ads? Become a member or log in