PPTAgent:生成和评估超越文本到幻灯片的演示文稿

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

January 7, 2025
作者: Hao Zheng, Xinyan Guan, Hao Kong, Jia Zheng, Hongyu Lin, Yaojie Lu, Ben He, Xianpei Han, Le Sun
cs.AI

摘要

从文档自动生成演示文稿是一项具有挑战性的任务,需要平衡内容质量、视觉设计和结构连贯性。现有方法主要集中在改进和评估内容质量,往往忽视视觉设计和结构连贯性,从而限制了它们的实际适用性。为了解决这些局限性,我们提出了PPTAgent,通过受人类工作流程启发的两阶段基于编辑的方法全面改进演示文稿生成。PPTAgent首先分析参考演示文稿以理解其结构模式和内容模式,然后通过代码操作起草大纲并生成幻灯片,以确保一致性和对齐。为了全面评估生成演示文稿的质量,我们进一步引入了PPTEval,一个评估框架,评估演示文稿的内容、设计和连贯性三个维度。实验证明,PPTAgent在所有三个维度上显著优于传统的自动演示文稿生成方法。代码和数据可在https://github.com/icip-cas/PPTAgent获取。
English
Automatically generating presentations from documents is a challenging task that requires balancing content quality, visual design, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, often overlooking visual design and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAgent, which comprehensively improves presentation generation through a two-stage, edit-based approach inspired by human workflows. PPTAgent first analyzes reference presentations to understand their structural patterns and content schemas, then drafts outlines and generates slides through code actions to ensure consistency and alignment. To comprehensively evaluate the quality of generated presentations, we further introduce PPTEval, an evaluation framework that assesses presentations across three dimensions: Content, Design, and Coherence. Experiments show that PPTAgent significantly outperforms traditional automatic presentation generation methods across all three dimensions. The code and data are available at https://github.com/icip-cas/PPTAgent.

Summary

AI-Generated Summary

PDF183January 8, 2025