Communication-efficient edge AI: Algorithms and systems
Artificial intelligence (AI) has achieved remarkable breakthroughs in a wide range of fields,
ranging from speech processing, image classification to drug discovery. This is driven by the …
ranging from speech processing, image classification to drug discovery. This is driven by the …
Automatic speech recognition: Systematic literature review
S Alharbi, M Alrazgan, A Alrashed, T Alnomasi… - Ieee …, 2021 - ieeexplore.ieee.org
A huge amount of research has been done in the field of speech signal processing in recent
years. In particular, there has been increasing interest in the automatic speech recognition …
years. In particular, there has been increasing interest in the automatic speech recognition …
Robustness of musical features on deep learning models for music genre classification
Music information retrieval (MIR) has witnessed rapid advances in various tasks like musical
similarity, music genre classification (MGC), etc. MGC and audio tagging are approached …
similarity, music genre classification (MGC), etc. MGC and audio tagging are approached …
[PDF][PDF] A Multi-Scale Fusion Framework for Bimodal Speech Emotion Recognition.
M Chen, X Zhao - Interspeech, 2020 - isca-archive.org
Speech emotion recognition (SER) is a challenging task that requires to learn suitable
features for achieving good performance. The development of deep learning techniques …
features for achieving good performance. The development of deep learning techniques …
A decentralized parallel algorithm for training generative adversarial nets
Abstract Generative Adversarial Networks (GANs) are a powerful class of generative models
in the deep learning community. Current practice on large-scale GAN training utilizes large …
in the deep learning community. Current practice on large-scale GAN training utilizes large …
[PDF][PDF] Speech recognition utilizing deep learning: A systematic review of the latest developments
Speech recognition is a natural language processing task that involves the computerized
transcription of spoken language in real time. Numerous studies have been conducted on …
transcription of spoken language in real time. Numerous studies have been conducted on …
Federated acoustic modeling for automatic speech recognition
Data privacy and protection is a crucial issue for any automatic speech recognition (ASR)
service provider when dealing with clients. In this paper, we investigate federated acoustic …
service provider when dealing with clients. In this paper, we investigate federated acoustic …
A(DP)SGD: Asynchronous Decentralized Parallel Stochastic Gradient Descent With Differential Privacy
As deep learning models are usually massive and complex, distributed learning is essential
for increasing training efficiency. Moreover, in many real-world application scenarios like …
for increasing training efficiency. Moreover, in many real-world application scenarios like …
PFNN-2: A domain decomposed penalty-free neural network method for solving partial differential equations
A new penalty-free neural network method, PFNN-2, is presented for solving partial
differential equations, which is a subsequent improvement of our previously proposed PFNN …
differential equations, which is a subsequent improvement of our previously proposed PFNN …
Computer vision based on a modular neural network for automatic assessment of physical therapy rehabilitation activities
JA Francisco, PS Rodrigues - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org
Physical rehabilitation techniques during the treatment of clinical pathology are one of the
most challenging areas for the medical structure, patients, and families. In large and …
most challenging areas for the medical structure, patients, and families. In large and …