The threat of offensive ai to organizations

Y Mirsky, A Demontis, J Kotak, R Shankar, D Gelei… - Computers & …, 2023 - Elsevier
AI has provided us with the ability to automate tasks, extract information from vast amounts of
data, and synthesize media that is nearly indistinguishable from the real thing. However …

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arXiv preprint arXiv:2106.15561, 2021 - arxiv.org
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Fastspeech: Fast, robust and controllable text to speech

Y Ren, Y Ruan, X Tan, T Qin, S Zhao… - Advances in neural …, 2019 - proceedings.neurips.cc
Neural network based end-to-end text to speech (TTS) has significantly improved the quality
of synthesized speech. Prominent methods (eg, Tacotron 2) usually first generate mel …

Barriers, drivers, and social considerations for AI adoption in supply chain management: a tertiary study

J Hangl, VJ Behrens, S Krause - Logistics, 2022 - mdpi.com
Background: The number of publications in supply chain management (SCM) and artificial
intelligence (AI) has risen significantly in the last two decades, and their quality and …

ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit

T Hayashi, R Yamamoto, K Inoue… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
This paper introduces a new end-to-end text-to-speech (E2E-TTS) toolkit named ESPnet-
TTS, which is an extension of the open-source speech processing toolkit ESPnet. The toolkit …

Towards end-to-end unsupervised speech recognition

AH Liu, WN Hsu, M Auli… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
Unsupervised speech recognition has shown great potential to make Automatic Speech
Recognition (ASR) systems accessible to every language. However, existing methods still …

Lrspeech: Extremely low-resource speech synthesis and recognition

J Xu, X Tan, Y Ren, T Qin, J Li, S Zhao… - Proceedings of the 26th …, 2020 - dl.acm.org
Speech synthesis (text to speech, TTS) and recognition (automatic speech recognition, ASR)
are important speech tasks, and require a large amount of text and speech pairs for model …

Mixspeech: Data augmentation for low-resource automatic speech recognition

L Meng, J Xu, X Tan, J Wang, T Qin… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
In this paper, we propose MixSpeech, a simple yet effective data augmentation method
based on mixup for automatic speech recognition (ASR). MixSpeech trains an ASR model …

Semi-supervised neural architecture search

R Luo, X Tan, R Wang, T Qin… - Advances in Neural …, 2020 - proceedings.neurips.cc
Neural architecture search (NAS) relies on a good controller to generate better architectures
or predict the accuracy of given architectures. However, training the controller requires both …

Unsupervised text-to-speech synthesis by unsupervised automatic speech recognition

J Ni, L Wang, H Gao, K Qian, Y Zhang, S Chang… - arXiv preprint arXiv …, 2022 - arxiv.org
An unsupervised text-to-speech synthesis (TTS) system learns to generate speech
waveforms corresponding to any written sentence in a language by observing: 1) a …