StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
deep-learning
pytorch
adversarial-training
diffusion-models
gan
latent-diffusion
latent-diffusion-models
speaker-adaptation
speech-synthesis
text-to-speech
tts
wavlm
Updated 2024-03-07 04:49:30 +00:00