[PDF][PDF] Recent advances in end-to-end automatic speech recognition

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com
Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org
Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

Scaling speech technology to 1,000+ languages

V Pratap, A Tjandra, B Shi, P Tomasello, A Babu… - Journal of Machine …, 2024 - jmlr.org
Expanding the language coverage of speech technology has the potential to improve
access to information for many more people. However, current speech technology is …

Unsupervised cross-lingual representation learning for speech recognition

A Conneau, A Baevski, R Collobert… - arXiv preprint arXiv …, 2020 - arxiv.org
This paper presents XLSR which learns cross-lingual speech representations by pretraining
a single model from the raw waveform of speech in multiple languages. We build on …

A survey on deep learning for big data

Q Zhang, LT Yang, Z Chen, P Li - Information Fusion, 2018 - Elsevier
Deep learning, as one of the most currently remarkable machine learning techniques, has
achieved great success in many applications such as image analysis, speech recognition …

Deepchain: Auditable and privacy-preserving deep learning with blockchain-based incentive

J Weng, J Weng, J Zhang, M Li… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Deep learning can achieve higher accuracy than traditional machine learning algorithms in
a variety of machine learning tasks. Recently, privacy-preserving deep learning has drawn …

Latent backdoor attacks on deep neural networks

Y Yao, H Li, H Zheng, BY Zhao - Proceedings of the 2019 ACM SIGSAC …, 2019 - dl.acm.org
Recent work proposed the concept of backdoor attacks on deep neural networks (DNNs),
where misclassification rules are hidden inside normal models, only to be triggered by very …

{TensorFlow}: a system for {Large-Scale} machine learning

M Abadi, P Barham, J Chen, Z Chen, A Davis… - … USENIX symposium on …, 2016 - usenix.org
TensorFlow is a machine learning system that operates at large scale and in heterogeneous
environments. Tensor-Flow uses dataflow graphs to represent computation, shared state …

Tensorflow: Large-scale machine learning on heterogeneous distributed systems

M Abadi, A Agarwal, P Barham, E Brevdo… - arXiv preprint arXiv …, 2016 - arxiv.org
TensorFlow is an interface for expressing machine learning algorithms, and an
implementation for executing such algorithms. A computation expressed using TensorFlow …

Speechstew: Simply mix all available speech recognition data to train one large neural network

W Chan, D Park, C Lee, Y Zhang, Q Le… - arXiv preprint arXiv …, 2021 - arxiv.org
We present SpeechStew, a speech recognition model that is trained on a combination of
various publicly available speech recognition datasets: AMI, Broadcast News, Common …