Style-based drum synthesis with GAN inversion: International Society of Music Information Retrieval Conference 2021

Jake Drysdale, Maciej Tomczak, Jason Hockman

Research output: Contribution to conferencePaper

Abstract

Neural audio synthesizers exploit deep learning as an alternative to traditional synthesizers that generate audio from hand-designed components, such as oscillators and wavetables. For a neural audio synthesizer to be applicable to music creation, meaningful control over the output is essential. This paper provides an overview of an unsupervised approach to deriving useful feature controls learned by a generative model. A system for generation and transformation of drum samples using a style-based generative adversarial network (GAN) is proposed. The system provides functional control of audio style features, based on principal component analysis (PCA) applied to the intermediate latent space. Additionally, we propose the use of an encoder trained to invert input drums back to the latent space of the pre-trained GAN. We experiment with three modes of control and provide audio results on a supporting website.
Original languageEnglish
Publication statusPublished (VoR) - 8 Nov 2021

Fingerprint

Dive into the research topics of 'Style-based drum synthesis with GAN inversion: International Society of Music Information Retrieval Conference 2021'. Together they form a unique fingerprint.

Cite this