Learning Bandwidth Expansion Using Perceptually-motivated Loss

Author(s): Feng, Berthy; Jin, Zeyu; Su, Jiaqi; Finkelstein, Adam

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1dn9s

Abstract:	We introduce a perceptually motivated approach to bandwidth expansion for speech. Our method pairs a new 3-way split variant of the FFTNet neural vocoder structure with a perceptual loss function, combining objectives from both the time and frequency domains. Mean opinion score tests show that it outperforms baseline methods from both domains, even for extreme bandwidth expansion.
Publication Date:	2019
Citation:	Feng, Berthy, Zeyu Jin, Jiaqi Su, and Adam Finkelstein. "Learning Bandwidth Expansion Using Perceptually-motivated Loss." IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2019): pp. 606-610. doi:10.1109/ICASSP.2019.8682367
DOI:	10.1109/ICASSP.2019.8682367
ISSN:	1520-6149
EISSN:	2379-190X
Pages:	606 - 610
Type of Material:	Conference Article
Journal/Proceeding Title:	IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Version:	Author's manuscript