種子音樂：一個統一框架，用於高質量和可控音樂生成。

摘要

我們介紹Seed-Music，這是一套能夠產生高品質音樂並具有精細風格控制的音樂生成系統。我們的統一框架結合自回歸語言建模和擴散方法，支持兩種關鍵音樂創作工作流程：受控音樂生成和後期製作編輯。對於受控音樂生成，我們的系統能夠從多模態輸入中獲取表演控制，包括風格描述、音頻參考、樂譜和語音提示，實現聲樂音樂生成。對於後期製作編輯，它提供了互動工具，可直接編輯生成音頻中的歌詞和聲樂旋律。我們鼓勵讀者在https://team.doubao.com/seed-music 聆聽示範音頻範例。

English

We introduce Seed-Music, a suite of music generation systems capable of producing high-quality music with fine-grained style control. Our unified framework leverages both auto-regressive language modeling and diffusion approaches to support two key music creation workflows: controlled music generation and post-production editing. For controlled music generation, our system enables vocal music generation with performance controls from multi-modal inputs, including style descriptions, audio references, musical scores, and voice prompts. For post-production editing, it offers interactive tools for editing lyrics and vocal melodies directly in the generated audio. We encourage readers to listen to demo audio examples at https://team.doubao.com/seed-music .

種子音樂：一個統一框架，用於高質量和可控音樂生成。

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

摘要

Summary

Support

Support