CNN-based multichannel end-to-end speech recognition for everyday home environments

[HTML][HTML] Intelligent Quran Recitation Recognition and Verification: Research Trends and Open Issues

SS Alrumiah, AA Al-Shargabi - Arabian Journal for Science and …, 2023 - Springer

Muslims aim to recite and memorize the Holy Quran correctly. However, traditional recitation
verification approaches depend on humans who may not be available. On the other hand …

被引用次数：7 相关文章所有 2 个版本

[PDF] hal.science

Detection of precursors of combustion instability using convolutional recurrent neural networks

A Cellier, CJ Lapeyre, G Öztarlik, T Poinsot… - Combustion and …, 2021 - Elsevier

Many combustors are prone to Thermoacoustic Instabilities (TAI). Being able to avoid TAI is
mandatory to efficiently operate a system without sacrificing neither performance nor safety …

被引用次数：31 相关文章所有 8 个版本

[PDF] ieee.org

Acoustic-based online monitoring of cooling fan malfunction in air-forced transformers using learning techniques

R Nematirad, M Behrang, A Pahwa - IEEE Access, 2024 - ieeexplore.ieee.org

Cooling fans are one of the critical components of air-forced dry-type transformers for
regulating internal temperatures. Therefore, effective malfunction detection is crucial to …

被引用次数：2 相关文章所有 4 个版本

[PDF] arxiv.org

Towards a competitive end-to-end speech recognition for CHiME-6 dinner party transcription

A Andrusenko, A Laptev, I Medennikov - arXiv preprint arXiv:2004.10799, 2020 - arxiv.org

While end-to-end ASR systems have proven competitive with the conventional hybrid
approach, they are prone to accuracy degradation when it comes to noisy and low-resource …

被引用次数：20 相关文章所有 9 个版本

[PDF] pubpub.org

Entangled: A multi-modal, multi-user interactive instrument in virtual 3d space using the smartphone for gesture control

M Lee - 2021 - nime.pubpub.org

In this paper, Entangled, a multi-modal instrument in virtual 3D space with sound, graphics,
and the smartphone-based gestural interface for multi-user is introduced. Within the same …

被引用次数：12 相关文章所有 3 个版本

[PDF] arxiv.org

Data augmentation methods for end-to-end speech recognition on distant-talk scenarios

E Tsunoo, K Shibata, C Narisetty, Y Kashiwagi… - arXiv preprint arXiv …, 2021 - arxiv.org

Although end-to-end automatic speech recognition (E2E ASR) has achieved great
performance in tasks that have numerous paired data, it is still challenging to make E2E …

被引用次数：10 相关文章所有 7 个版本

Class-specific early exit design methodology for convolutional neural networks

V Bonato, CS Bouganis - Applied Soft Computing, 2021 - Elsevier

Abstract Convolutional Neural Network-based (CNN) inference is a demanding
computational task where a long sequence of operations is applied to an input as dictated …

被引用次数：10 相关文章所有 4 个版本

Improving automatic speech recognition utilizing audio-codecs for data augmentation

N Hailu, I Siegert, A Nürnberger - 2020 IEEE 22nd International …, 2020 - ieeexplore.ieee.org

To train end-to-end automatic speech recognition models, it requires a large amount of
labeled speech data. This goal is challenging for languages with fewer resources. In …

被引用次数：9 相关文章

[PDF] escholarship.org

[图书][B] Coherent Digital Multimodal Instrument Design and the Evaluation of Crossmodal Correspondence

M Lee - 2023 - search.proquest.com

The rapid development of the current availability of advanced hardware and software is
opening up new opportunities for digital creation every day. This situation provides great …

被引用次数：2 相关文章所有 2 个版本

基于带阈值的BPE-dropout 多任务学习的端到端语音识别.

马建，朵琳，韦贵香，唐剑 - Journal of Jilin University …, 2024 - search.ebscohost.com

针对语音识别任务中出现的未登录词问题, 提出一种带阈值的BPE-dropout 多任务学习语音识别
方法. 该方法采用带随机性的字节对编码算法, 在形成子词时引入带字数阈值的策略 …