Queen Mary University of London, Sony CSL – Paris Music Team
Marco Pasini is a PhD student at Queen Mary University of London. In collaboration with Sony CSL Paris, he researches ways to make generative models for audio and music both faster and more controllable.
Fast and Controllable Generative Models for Music
This presentation will showcase my PhD research focused on democratizing AI music creation by developing models that are both fast and controllable. I will present novel architectures and training techniques, from efficient consistency autoencoders for audio compression to latent diffusion models for controllable accompaniment, demonstrating how to achieve high-quality waveform music generation with unprecedented speed and user-driven creative direction.