Abstract: This study proposes an innovative speech translation method based on Pix2PixGAN, which maps the Mel spectrograms of speech produced by deaf individuals to those of normal-hearing individuals ...
Abstract: A new neural network architecture is proposed that can be used to convert Mel spectrograms into an audio signal. The architecture is designed from the ground up to be run on a mobile device, ...
A major direction of Deep Learning in audio, especially generative models, is using features in frequency domain because directly model raw time signal is hard. But this require an extra process to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results