ChatPaper.aiChatPaper

Open-Sora 2.0:以20万美元成本训练商业级视频生成模型

Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k

March 12, 2025
作者: Xiangyu Peng, Zangwei Zheng, Chenhui Shen, Tom Young, Xinying Guo, Binluo Wang, Hang Xu, Hongxin Liu, Mingyan Jiang, Wenjun Li, Yuhui Wang, Anbang Ye, Gang Ren, Qianran Ma, Wanying Liang, Xiang Lian, Xiwen Wu, Yuting Zhong, Zhuangyan Li, Chaoyu Gong, Guojun Lei, Leijun Cheng, Limin Zhang, Minghao Li, Ruijie Zhang, Silan Hu, Shijie Huang, Xiaokang Wang, Yuanheng Zhao, Yuqi Wang, Ziang Wei, Yang You
cs.AI

摘要

过去一年中,视频生成模型取得了显著进展。AI视频质量持续提升,但代价是模型规模增大、数据量增加以及对训练算力的更高需求。在本报告中,我们推出了Open-Sora 2.0,一款仅耗资20万美元训练的商业级视频生成模型。通过该模型,我们证明了训练顶级视频生成模型的成本是高度可控的。我们详细介绍了促成这一效率突破的所有技术,包括数据筛选、模型架构、训练策略和系统优化。根据人类评估结果和VBench评分,Open-Sora 2.0与全球领先的视频生成模型,如开源版HunyuanVideo和闭源版Runway Gen-3 Alpha,表现相当。通过将Open-Sora 2.0完全开源,我们旨在普及先进视频生成技术的获取,促进内容创作领域更广泛的创新与创造力。所有资源均公开于:https://github.com/hpcaitech/Open-Sora。
English
Video generation models have achieved remarkable progress in the past year. The quality of AI video continues to improve, but at the cost of larger model size, increased data quantity, and greater demand for training compute. In this report, we present Open-Sora 2.0, a commercial-level video generation model trained for only $200k. With this model, we demonstrate that the cost of training a top-performing video generation model is highly controllable. We detail all techniques that contribute to this efficiency breakthrough, including data curation, model architecture, training strategy, and system optimization. According to human evaluation results and VBench scores, Open-Sora 2.0 is comparable to global leading video generation models including the open-source HunyuanVideo and the closed-source Runway Gen-3 Alpha. By making Open-Sora 2.0 fully open-source, we aim to democratize access to advanced video generation technology, fostering broader innovation and creativity in content creation. All resources are publicly available at: https://github.com/hpcaitech/Open-Sora.

Summary

AI-Generated Summary

PDF91March 14, 2025