DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Ziqian Ning1, Huakang Chen1, Yuepeng Jiang1, Jixun Yao1, Chunbo Hao1, Guobin Ma1, Shuai Wang2, Lei Xie1
Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University, Xi'an, China
Shenzhen Research Institute of Big Data, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China
GitHub
Hugging Face
Demo
1. Abstract
Recent advancements in music generation have garnered significant attention, yet existing approaches face critica...
Read more at aslp-lab.github.io