Skip to main content

HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Author(s): Su, Jiaqi; Jin, Zeyu; Finkelstein, Adam

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1kz73
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSu, Jiaqi-
dc.contributor.authorJin, Zeyu-
dc.contributor.authorFinkelstein, Adam-
dc.date.accessioned2021-10-08T19:50:55Z-
dc.date.available2021-10-08T19:50:55Z-
dc.date.issued2020en_US
dc.identifier.citationSu, Jiaqi, Zeyu Jin, and Adam Finkelstein. "HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks." Proc. Interspeech (2020): pp. 4506-4510. doi:10.21437/Interspeech.2020-2143en_US
dc.identifier.urihttps://arxiv.org/pdf/2006.05694.pdf-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1kz73-
dc.description.abstractReal-world audio recordings are often degraded by factors such as noise, reverberation, and equalization distortion. This paper introduces HiFi-GAN, a deep learning method to transform recorded speech to sound as though it had been recorded in a studio. We use an end-to-end feed-forward WaveNet architecture, trained with multi-scale adversarial discriminators in both the time domain and the time-frequency domain. It relies on the deep feature matching losses of the discriminators to improve the perceptual quality of enhanced speech. The proposed model generalizes well to new speakers, new speech content, and new environments. It significantly outperforms state-of-the-art baseline methods in both objective and subjective experiments.en_US
dc.format.extent4506 - 4510en_US
dc.language.isoen_USen_US
dc.relation.ispartofProc. Interspeechen_US
dc.rightsAuthor's manuscripten_US
dc.titleHiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networksen_US
dc.typeConference Articleen_US
dc.identifier.doi10.21437/Interspeech.2020-2143-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
HiFiGan.pdf616.03 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.