HierSpeech++ : Hierarchical Variational Inference for Zero-shot Speech Synthesis
The recent developments and the progress in the capabilities of large language models have played a crucial role in the advancements of LLM-based frameworks for audio generation and speech synthesis tasks especially in the zero-shot setting. Traditional speech synthesis frameworks …