Distributed deep learning strategies for automatic speech recognition

Y Shi, K Yang, T Jiang, J Zhang… - … Surveys & Tutorials, 2020 - ieeexplore.ieee.org

Artificial intelligence (AI) has achieved remarkable breakthroughs in a wide range of fields,
ranging from speech processing, image classification to drug discovery. This is driven by the …

被引用次数：375 相关文章所有 8 个版本

[PDF] ieee.org

Automatic speech recognition: Systematic literature review

S Alharbi, M Alrazgan, A Alrashed, T Alnomasi… - Ieee …, 2021 - ieeexplore.ieee.org

A huge amount of research has been done in the field of speech signal processing in recent
years. In particular, there has been increasing interest in the automatic speech recognition …

被引用次数：92 相关文章所有 4 个版本

Robustness of musical features on deep learning models for music genre classification

Y Singh, A Biswas - Expert Systems with Applications, 2022 - Elsevier

Music information retrieval (MIR) has witnessed rapid advances in various tasks like musical
similarity, music genre classification (MGC), etc. MGC and audio tagging are approached …

被引用次数：34 相关文章所有 2 个版本

[PDF] isca-archive.org

[PDF][PDF] A Multi-Scale Fusion Framework for Bimodal Speech Emotion Recognition.

M Chen, X Zhao - Interspeech, 2020 - isca-archive.org

Speech emotion recognition (SER) is a challenging task that requires to learn suitable
features for achieving good performance. The development of deep learning techniques …

被引用次数：63 相关文章所有 5 个版本

[PDF] neurips.cc

A decentralized parallel algorithm for training generative adversarial nets

M Liu, W Zhang, Y Mroueh, X Cui… - Advances in …, 2020 - proceedings.neurips.cc

Abstract Generative Adversarial Networks (GANs) are a powerful class of generative models
in the deep learning community. Current practice on large-scale GAN training utilizes large …

被引用次数：78 相关文章所有 12 个版本

[PDF] researchgate.net

[PDF][PDF] Speech recognition utilizing deep learning: A systematic review of the latest developments

D Al-Fraihat, Y Sharrab, F Alzyoud… - Human-centric …, 2024 - researchgate.net

Speech recognition is a natural language processing task that involves the computerized
transcription of spoken language in real time. Numerous studies have been conducted on …

被引用次数：12 相关文章所有 3 个版本

[PDF] arxiv.org

Federated acoustic modeling for automatic speech recognition

X Cui, S Lu, B Kingsbury - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Data privacy and protection is a crucial issue for any automatic speech recognition (ASR)
service provider when dealing with clients. In this paper, we investigate federated acoustic …

被引用次数：45 相关文章所有 4 个版本

[PDF] arxiv.org

A(DP)SGD: Asynchronous Decentralized Parallel Stochastic Gradient Descent With Differential Privacy

J Xu, W Zhang, F Wang - IEEE transactions on pattern analysis …, 2021 - ieeexplore.ieee.org

As deep learning models are usually massive and complex, distributed learning is essential
for increasing training efficiency. Moreover, in many real-world application scenarios like …

被引用次数：35 相关文章所有 8 个版本

[PDF] arxiv.org

PFNN-2: A domain decomposed penalty-free neural network method for solving partial differential equations

H Sheng, C Yang - arXiv preprint arXiv:2205.00593, 2022 - arxiv.org

A new penalty-free neural network method, PFNN-2, is presented for solving partial
differential equations, which is a subsequent improvement of our previously proposed PFNN …

被引用次数：18 相关文章所有 4 个版本

[PDF] ieee.org

Computer vision based on a modular neural network for automatic assessment of physical therapy rehabilitation activities

JA Francisco, PS Rodrigues - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org

Physical rehabilitation techniques during the treatment of clinical pathology are one of the
most challenging areas for the medical structure, patients, and families. In large and …

被引用次数：15 相关文章所有 3 个版本