[HTML][HTML] Intelligent Quran Recitation Recognition and Verification: Research Trends and Open Issues

SS Alrumiah, AA Al-Shargabi - Arabian Journal for Science and …, 2023 - Springer
Muslims aim to recite and memorize the Holy Quran correctly. However, traditional recitation
verification approaches depend on humans who may not be available. On the other hand …

Detection of precursors of combustion instability using convolutional recurrent neural networks

A Cellier, CJ Lapeyre, G Öztarlik, T Poinsot… - Combustion and …, 2021 - Elsevier
Many combustors are prone to Thermoacoustic Instabilities (TAI). Being able to avoid TAI is
mandatory to efficiently operate a system without sacrificing neither performance nor safety …

Acoustic-based online monitoring of cooling fan malfunction in air-forced transformers using learning techniques

R Nematirad, M Behrang, A Pahwa - IEEE Access, 2024 - ieeexplore.ieee.org
Cooling fans are one of the critical components of air-forced dry-type transformers for
regulating internal temperatures. Therefore, effective malfunction detection is crucial to …

Towards a competitive end-to-end speech recognition for CHiME-6 dinner party transcription

A Andrusenko, A Laptev, I Medennikov - arXiv preprint arXiv:2004.10799, 2020 - arxiv.org
While end-to-end ASR systems have proven competitive with the conventional hybrid
approach, they are prone to accuracy degradation when it comes to noisy and low-resource …

Entangled: A multi-modal, multi-user interactive instrument in virtual 3d space using the smartphone for gesture control

M Lee - 2021 - nime.pubpub.org
In this paper, Entangled, a multi-modal instrument in virtual 3D space with sound, graphics,
and the smartphone-based gestural interface for multi-user is introduced. Within the same …

Data augmentation methods for end-to-end speech recognition on distant-talk scenarios

E Tsunoo, K Shibata, C Narisetty, Y Kashiwagi… - arXiv preprint arXiv …, 2021 - arxiv.org
Although end-to-end automatic speech recognition (E2E ASR) has achieved great
performance in tasks that have numerous paired data, it is still challenging to make E2E …

Class-specific early exit design methodology for convolutional neural networks

V Bonato, CS Bouganis - Applied Soft Computing, 2021 - Elsevier
Abstract Convolutional Neural Network-based (CNN) inference is a demanding
computational task where a long sequence of operations is applied to an input as dictated …

Improving automatic speech recognition utilizing audio-codecs for data augmentation

N Hailu, I Siegert, A Nürnberger - 2020 IEEE 22nd International …, 2020 - ieeexplore.ieee.org
To train end-to-end automatic speech recognition models, it requires a large amount of
labeled speech data. This goal is challenging for languages with fewer resources. In …

[图书][B] Coherent Digital Multimodal Instrument Design and the Evaluation of Crossmodal Correspondence

M Lee - 2023 - search.proquest.com
The rapid development of the current availability of advanced hardware and software is
opening up new opportunities for digital creation every day. This situation provides great …

基于带阈值的BPE-dropout 多任务学习的端到端语音识别.

马建, 朵琳, 韦贵香, 唐剑 - Journal of Jilin University …, 2024 - search.ebscohost.com
针对语音识别任务中出现的未登录词问题, 提出一种带阈值的BPE-dropout 多任务学习语音识别
方法. 该方法采用带随机性的字节对编码算法, 在形成子词时引入带字数阈值的策略 …