Show simple item record

dc.contributor.advisorGunawardena, Athula
dc.contributor.advisorMukherjee, Lopamudra
dc.contributor.advisorZhou, Jiazhen
dc.contributor.authorPfantz, Levi
dc.date.accessioned2022-01-06T17:26:27Z
dc.date.available2022-01-06T17:26:27Z
dc.date.issued2021-12
dc.identifier.urihttp://digital.library.wisc.edu/1793/82595
dc.descriptionThis file was last viewed in Adobe Acrobat Pro.en_US
dc.description.abstractTraining neural networks require sizeable datasets for meaningful output. It is difficult to acquire large datasets for many types of data. This is especially challenging for individuals and small organizations. We have taken SinGAN [1], a model that works to address those issues in the image domain, and extended it to work in the audio domain. Our new model, called AudioSinGAN, uses deep convolutional generative adversarial networks (DCGAN) trained on a single audio sample to generate new, unique, audio samples. Like SinGAN, AudioSinGAN uses a pyramid of unique GANs, each responsible for learning and generating different levels of detail. Our system is capable of generating unique audio with clear features from the single input audio clip. We explore and discuss the realities of converting and tuning a generative adversarial network (GAN) built for images into one built for audio and our results. We also present a database of audio clips generated by AudioSinGAN and use Singular Value Decomposition to analyze the dataset and confirm that our model successfully generates audio belonging to unique classes. We also learn that a challenge facing our system is audio that contains multiple audio sources overlapping each other. Finally, we discuss methods to address this issue including splitting audio into frequency band before processing.en_US
dc.language.isoen_USen_US
dc.publisherUniversity of Wisconsin - Whitewateren_US
dc.subjectMachine learningen_US
dc.subjectNeural networks (Computer science)en_US
dc.subjectAudio frequencyen_US
dc.titleAudio generation from a single sample using deep convolutional generative adversarial networksen_US
dc.typeThesisen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record