ChatPaper.aiChatPaper

種子音樂:一個統一框架,用於高質量和可控音樂生成。

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

September 13, 2024
作者: Ye Bai, Haonan Chen, Jitong Chen, Zhuo Chen, Yi Deng, Xiaohong Dong, Lamtharn Hantrakul, Weituo Hao, Qingqing Huang, Zhongyi Huang, Dongya Jia, Feihu La, Duc Le, Bochen Li, Chumin Li, Hui Li, Xingxing Li, Shouda Liu, Wei-Tsung Lu, Yiqing Lu, Andrew Shaw, Janne Spijkervet, Yakun Sun, Bo Wang, Ju-Chiang Wang, Yuping Wang, Yuxuan Wang, Ling Xu, Yifeng Yang, Chao Yao, Shuo Zhang, Yang Zhang, Yilin Zhang, Hang Zhao, Ziyi Zhao, Dejian Zhong, Shicen Zhou, Pei Zou
cs.AI

摘要

我們介紹Seed-Music,這是一套能夠產生高品質音樂並具有精細風格控制的音樂生成系統。我們的統一框架結合自回歸語言建模和擴散方法,支持兩種關鍵音樂創作工作流程:受控音樂生成和後期製作編輯。對於受控音樂生成,我們的系統能夠從多模態輸入中獲取表演控制,包括風格描述、音頻參考、樂譜和語音提示,實現聲樂音樂生成。對於後期製作編輯,它提供了互動工具,可直接編輯生成音頻中的歌詞和聲樂旋律。 我們鼓勵讀者在https://team.doubao.com/seed-music 聆聽示範音頻範例。
English
We introduce Seed-Music, a suite of music generation systems capable of producing high-quality music with fine-grained style control. Our unified framework leverages both auto-regressive language modeling and diffusion approaches to support two key music creation workflows: controlled music generation and post-production editing. For controlled music generation, our system enables vocal music generation with performance controls from multi-modal inputs, including style descriptions, audio references, musical scores, and voice prompts. For post-production editing, it offers interactive tools for editing lyrics and vocal melodies directly in the generated audio. We encourage readers to listen to demo audio examples at https://team.doubao.com/seed-music .

Summary

AI-Generated Summary

PDF543November 16, 2024