PPTAgent:生成並評估超越文本到幻燈片的演示文稿

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

January 7, 2025
作者: Hao Zheng, Xinyan Guan, Hao Kong, Jia Zheng, Hongyu Lin, Yaojie Lu, Ben He, Xianpei Han, Le Sun
cs.AI

摘要

從文件自動生成簡報是一項具有挑戰性的任務,需要平衡內容質量、視覺設計和結構連貫。現有方法主要集中於改進和評估內容質量,往往忽略視覺設計和結構連貫,這限制了它們的實際應用性。為了解決這些限制,我們提出了PPTAgent,通過受人類工作流程啟發的兩階段基於編輯的方法全面改進簡報生成。PPTAgent首先分析參考簡報以了解其結構模式和內容架構,然後通過代碼操作起草大綱並生成幻燈片,以確保一致性和對齊。為了全面評估生成簡報的質量,我們進一步引入了PPTEval,一個評估框架,評估簡報的三個維度:內容、設計和連貫。實驗表明,PPTAgent在所有三個維度上明顯優於傳統的自動簡報生成方法。代碼和數據可在https://github.com/icip-cas/PPTAgent找到。
English
Automatically generating presentations from documents is a challenging task that requires balancing content quality, visual design, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, often overlooking visual design and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAgent, which comprehensively improves presentation generation through a two-stage, edit-based approach inspired by human workflows. PPTAgent first analyzes reference presentations to understand their structural patterns and content schemas, then drafts outlines and generates slides through code actions to ensure consistency and alignment. To comprehensively evaluate the quality of generated presentations, we further introduce PPTEval, an evaluation framework that assesses presentations across three dimensions: Content, Design, and Coherence. Experiments show that PPTAgent significantly outperforms traditional automatic presentation generation methods across all three dimensions. The code and data are available at https://github.com/icip-cas/PPTAgent.

Summary

AI-Generated Summary

PDF183January 8, 2025