Voice in human–agent interaction: A survey

K Seaborn, NP Miyake, P Pennefather… - ACM Computing …, 2021 - dl.acm.org
Social robots, conversational agents, voice assistants, and other embodied AI are
increasingly a feature of everyday life. What connects these various types of intelligent …

The state of speech in HCI: Trends, themes and challenges

L Clark, P Doyle, D Garaialde, E Gilmartin… - Interacting with …, 2019 - academic.oup.com
Speech interfaces are growing in popularity. Through a review of 99 research papers this
work maps the trends, themes, findings and methods of empirical research on speech …

[HTML][HTML] The human takes it all: Humanlike synthesized voices are perceived as less eerie and more likable. evidence from a subjective ratings study

K Kühne, MH Fischer, Y Zhou - Frontiers in neurorobotics, 2020 - frontiersin.org
Background: The increasing involvement of social robots in human lives raises the question
how humans perceive social robots. Little is known about human perception of synthesized …

[PDF][PDF] Speech synthesis evaluation—state-of-the-art assessment and suggestion for a novel research program

P Wagner, J Beskow, S Betz, J Edlund… - Proceedings of the 10th …, 2019 - core.ac.uk
Speech synthesis applications have become an ubiquity, in navigation systems, digital
assistants or as screen or audio book readers. Despite their impact on the acceptability of …

Choice of voices: A large-scale evaluation of text-to-speech voice quality for long-form content

J Cambre, J Colnago, J Maddock, J Tsai… - Proceedings of the 2020 …, 2020 - dl.acm.org
The advancement of text-to-speech (TTS) voices and a rise of commercial TTS platforms
allow people to easily experience TTS voices across a variety of technologies, applications …

Siri, echo and performance: You have to suffer darling

MP Aylett, BR Cowan, L Clark - Extended Abstracts of the 2019 CHI …, 2019 - dl.acm.org
Don't ignore this because its about speech technology. VUIs (voice user interfaces) won a
best paper in CHI 2018. Did that get your attention? Good. Siri, Ivona, Google Home, and …

Evaluating long-form text-to-speech: Comparing the ratings of sentences and paragraphs

R Clark, H Silen, T Kenter, R Leith - arXiv preprint arXiv:1909.03965, 2019 - arxiv.org
Text-to-speech systems are typically evaluated on single sentences. When long-form
content, such as data consisting of full paragraphs or dialogues is considered, evaluating …

Interactive hesitation synthesis: modelling and evaluation

S Betz, B Carlmeyer, P Wagner, B Wrede - Multimodal Technologies and …, 2018 - mdpi.com
Conversational spoken dialogue systems that interact with the user rather than merely
reading the text can be equipped with hesitations to manage dialogue flow and user …

Building and designing expressive speech synthesis

MP Aylett, L Clark, BR Cowan, I Torre - … Agents: 20 years of Research on …, 2021 - dl.acm.org
We know there is something special about speech. Our voices are not just a means of
communicating. They also give a deep impression of who we are and what we might know …

Persuasive synthetic speech: Voice perception and user behaviour

M Dubiel, M Halvey, PO Gallegos, S King - Proceedings of the 2nd …, 2020 - dl.acm.org
Previous research indicates that synthetic speech can be as persuasive as human speech.
However, there is a lack of empirical validation on interactive goal-oriented tasks. In our two …