Acoustic features modelling for statistical parametric speech synthesis: a review

Z Mu, X Yang, Y Dong - arXiv preprint arXiv:2104.09995, 2021 - arxiv.org

As an indispensable part of modern human-computer interaction system, speech synthesis
technology helps users get the output of intelligent machine more easily and intuitively, thus …

被引用次数：34 相关文章所有 2 个版本

Advance research in agricultural text-to-speech: the word segmentation of analytic language and the deep learning-based end-to-end system

X Li, D Ma, B Yin - Computers and Electronics in Agriculture, 2021 - Elsevier

Abstract Agricultural Text-to-Speech (TTS) has attracted increasingly more attention. The
application of agricultural TTS and its problems are analyzed in this paper, and the …

被引用次数：20 相关文章所有 4 个版本

Ask a further question or give a list? how should conversational agents reply to users' uncertain queries

Q Ma, Y Zhang, W Xu, R Zhou - International Journal of Human …, 2024 - Taylor & Francis

Conversational agents (CAs) have recently become ubiquitous. Smart speakers, mobile
phone voice assistants, and in-car voice assistants have entered our lives. Studies have …

被引用次数：5 相关文章所有 2 个版本

[HTML] mdpi.com

[HTML][HTML] A Novel End-to-End Turkish Text-to-Speech (TTS) System via Deep Learning

S Oyucu - Electronics, 2023 - mdpi.com

Text-to-Speech (TTS) systems have made strides but creating natural-sounding human
voices remains challenging. Existing methods rely on noncomprehensive models with only …

被引用次数：4 相关文章所有 3 个版本

[PDF] researchgate.net

Rationally or emotionally: how should voice user interfaces reply to users of different genders considering user experience?

Q Ma, R Zhou, C Zhang, Z Chen - Cognition, Technology & Work, 2022 - Springer

Voice user interfaces (VUIs) have exploded in popularity over the past 3 years. However,
there has been little research on the reply methods that VUIs can adopt to communicate with …

被引用次数：6 相关文章所有 5 个版本

[HTML] springer.com Full View

[HTML][HTML] A multi-task learning speech synthesis optimization method based on CWT: a case study of Tacotron2

G Hu, Z Ruan, W Guo, Y Quan - EURASIP Journal on Advances in Signal …, 2024 - Springer

Text-to-speech synthesis plays an essential role in facilitating human-computer interaction.
Currently, the predominant approach in Text-to-speech acoustic models selects only the Mel …

被引用次数：1 相关文章所有 10 个版本

[PDF] wiley.com Full View

Study about Chinese speech synthesis algorithm and acoustic model based on wireless communication network

L Shi, M Li, Y Su, Y Chen - Wireless Communications and …, 2021 - Wiley Online Library

Chinese speech synthesis refers to the technology that machines transform human speech
signals into corresponding texts or commands through recognition and understanding. This …

被引用次数：2 相关文章所有 4 个版本

Review of time–frequency masking approach for improving speech intelligibility in noise

G Kim - IETE Technical Review, 2022 - Taylor & Francis

Over the last decade, time–frequency masking techniques have been explored to achieve
substantial improvement of speech intelligibility in noise. Binary or soft mask can be applied …

被引用次数：3 相关文章所有 3 个版本

How to design the expression ways of conversational agents based on affective experience

C Zhang, R Zhou, Y Zhang, Y Sun, L Zou… - … , HCI 2020, Held as Part of …, 2020 - Springer

With the rapid development of artificial intelligence, the technology of human-computer
interaction is becoming more and more mature. The variety of terminal products equipped …

被引用次数：4 相关文章所有 2 个版本

[PDF] wiley.com Full View

[Retracted] The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting

J Yang - Journal of Control Science and Engineering, 2022 - Wiley Online Library

To improve the sound quality of speech synthesis technology in intelligent broadcasting, a
deep neural network‐based method is proposed. It also proved the effectiveness of the DNN …

被引用次数：1 相关文章所有 6 个版本