Controlling Emotion in Text-to-Speech with Natural Language Prompts
In recent years, prompting has quickly become one of the standard ways of steering the
outputs of generative machine learning models, due to its intuitive use of natural language …
outputs of generative machine learning models, due to its intuitive use of natural language …
EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech
In the current era of Artificial Intelligence Generated Content (AIGC), a Low-Rank Adaptation
(LoRA) method has emerged. It uses a plugin-based approach to learn new knowledge with …
(LoRA) method has emerged. It uses a plugin-based approach to learn new knowledge with …
[引用][C] FastSpeechStyle: Fast, Emotion Controllable, and High-Quality Speech Synthesis
Non-autoregressive text-to-speech models such as Fastspeech2 can fast synthesize high-
quality speech. This model also allows explicit control of the speech signal's pitch, energy …
quality speech. This model also allows explicit control of the speech signal's pitch, energy …