PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
text-to-speech
deep-neural-networks
pytorch
tts
speech-synthesis
generative-model
semi-supervised-learning
global-style-tokens
neural-tts
non-autoregressive
parallel-tacotron
non-ar
emotion-transfer
cross-speaker
conditional-layer-normalization
-
Updated
Nov 9, 2022 - Python