AudioX: generating audio and music from referenced text, images, and video
General Introduction AudioX is an open source project on GitHub by Zeyue Tian et al. The official paper is published on arXiv (No. 2503.10522). It is based on the diffusion transformer (Diffusion Transf...
































































































