Review of end-to-end speech synthesis technology based on deep learning

Z Mu, X Yang, Y Dong - arXiv preprint arXiv:2104.09995, 2021 - arxiv.org
As an indispensable part of modern human-computer interaction system, speech synthesis
technology helps users get the output of intelligent machine more easily and intuitively, thus …

Advance research in agricultural text-to-speech: the word segmentation of analytic language and the deep learning-based end-to-end system

X Li, D Ma, B Yin - Computers and Electronics in Agriculture, 2021 - Elsevier
Abstract Agricultural Text-to-Speech (TTS) has attracted increasingly more attention. The
application of agricultural TTS and its problems are analyzed in this paper, and the …

Ask a further question or give a list? how should conversational agents reply to users' uncertain queries

Q Ma, Y Zhang, W Xu, R Zhou - International Journal of Human …, 2024 - Taylor & Francis
Conversational agents (CAs) have recently become ubiquitous. Smart speakers, mobile
phone voice assistants, and in-car voice assistants have entered our lives. Studies have …

[HTML][HTML] A Novel End-to-End Turkish Text-to-Speech (TTS) System via Deep Learning

S Oyucu - Electronics, 2023 - mdpi.com
Text-to-Speech (TTS) systems have made strides but creating natural-sounding human
voices remains challenging. Existing methods rely on noncomprehensive models with only …

Rationally or emotionally: how should voice user interfaces reply to users of different genders considering user experience?

Q Ma, R Zhou, C Zhang, Z Chen - Cognition, Technology & Work, 2022 - Springer
Voice user interfaces (VUIs) have exploded in popularity over the past 3 years. However,
there has been little research on the reply methods that VUIs can adopt to communicate with …

[HTML][HTML] A multi-task learning speech synthesis optimization method based on CWT: a case study of Tacotron2

G Hu, Z Ruan, W Guo, Y Quan - EURASIP Journal on Advances in Signal …, 2024 - Springer
Text-to-speech synthesis plays an essential role in facilitating human-computer interaction.
Currently, the predominant approach in Text-to-speech acoustic models selects only the Mel …

Study about Chinese speech synthesis algorithm and acoustic model based on wireless communication network

L Shi, M Li, Y Su, Y Chen - Wireless Communications and …, 2021 - Wiley Online Library
Chinese speech synthesis refers to the technology that machines transform human speech
signals into corresponding texts or commands through recognition and understanding. This …

Review of time–frequency masking approach for improving speech intelligibility in noise

G Kim - IETE Technical Review, 2022 - Taylor & Francis
Over the last decade, time–frequency masking techniques have been explored to achieve
substantial improvement of speech intelligibility in noise. Binary or soft mask can be applied …

How to design the expression ways of conversational agents based on affective experience

C Zhang, R Zhou, Y Zhang, Y Sun, L Zou… - … , HCI 2020, Held as Part of …, 2020 - Springer
With the rapid development of artificial intelligence, the technology of human-computer
interaction is becoming more and more mature. The variety of terminal products equipped …

[Retracted] The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting

J Yang - Journal of Control Science and Engineering, 2022 - Wiley Online Library
To improve the sound quality of speech synthesis technology in intelligent broadcasting, a
deep neural network‐based method is proposed. It also proved the effectiveness of the DNN …