ChatPaper.aiChatPaper

基于多智能体思维链规划的自动化电影生成

Automated Movie Generation via Multi-Agent CoT Planning

March 10, 2025
作者: Weijia Wu, Zeyu Zhu, Mike Zheng Shou
cs.AI

摘要

现有的长视频生成框架缺乏自动化规划,需要人工输入剧情、场景、摄影和角色互动,导致成本高昂且效率低下。为解决这些问题,我们提出了MovieAgent,一种通过多代理链式思维(CoT)规划实现自动化电影生成的方法。MovieAgent具备两大优势:1)我们首次探索并定义了自动化电影/长视频生成的范式。给定剧本和角色库,MovieAgent能够生成多场景、多镜头的长视频,确保叙事连贯、角色一致、字幕同步以及音频稳定。2)MovieAgent引入了基于层次化CoT的推理过程,自动构建场景、相机设置和摄影技术,显著减少了人力投入。通过部署多个大语言模型(LLM)代理模拟导演、编剧、分镜师和场地管理员的角色,MovieAgent优化了制作流程。实验表明,MovieAgent在剧本忠实度、角色一致性和叙事连贯性方面达到了新的最先进水平。我们的层次化框架向前迈进了一步,为全自动化电影生成提供了新的见解。代码和项目网站可在以下地址获取:https://github.com/showlab/MovieAgent 和 https://weijiawu.github.io/MovieAgent。
English
Existing long-form video generation frameworks lack automated planning, requiring manual input for storylines, scenes, cinematography, and character interactions, resulting in high costs and inefficiencies. To address these challenges, we present MovieAgent, an automated movie generation via multi-agent Chain of Thought (CoT) planning. MovieAgent offers two key advantages: 1) We firstly explore and define the paradigm of automated movie/long-video generation. Given a script and character bank, our MovieAgent can generates multi-scene, multi-shot long-form videos with a coherent narrative, while ensuring character consistency, synchronized subtitles, and stable audio throughout the film. 2) MovieAgent introduces a hierarchical CoT-based reasoning process to automatically structure scenes, camera settings, and cinematography, significantly reducing human effort. By employing multiple LLM agents to simulate the roles of a director, screenwriter, storyboard artist, and location manager, MovieAgent streamlines the production pipeline. Experiments demonstrate that MovieAgent achieves new state-of-the-art results in script faithfulness, character consistency, and narrative coherence. Our hierarchical framework takes a step forward and provides new insights into fully automated movie generation. The code and project website are available at: https://github.com/showlab/MovieAgent and https://weijiawu.github.io/MovieAgent.

Summary

AI-Generated Summary

PDF452March 11, 2025