Robust automatic speech recognition: a bridge to practical applications

R Dias, A Torkamani - Genome medicine, 2019 - Springer

Artificial intelligence (AI) is the development of computer systems that are able to perform
tasks that normally require human intelligence. Advances in AI software and hardware …

被引用次数：335 相关文章所有 13 个版本

[PDF] mdpi.com

Emotion recognition using different sensors, emotion models, methods and datasets: A comprehensive review

Y Cai, X Li, J Li - Sensors, 2023 - mdpi.com

In recent years, the rapid development of sensors and information technology has made it
possible for machines to recognize and analyze human emotions. Emotion recognition is an …

被引用次数：43 相关文章所有 8 个版本

[PDF] arxiv.org

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

S Watanabe, M Mandel, J Barker, E Vincent… - arXiv preprint arXiv …, 2020 - arxiv.org

Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the
6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge …

被引用次数：306 相关文章所有 7 个版本

[PDF] arxiv.org

The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines

J Barker, S Watanabe, E Vincent, J Trmal - arXiv preprint arXiv …, 2018 - arxiv.org

The CHiME challenge series aims to advance robust automatic speech recognition (ASR)
technology by promoting research at the interface of speech and language processing …

被引用次数：406 相关文章所有 11 个版本

[PDF] cambridge.org

Digital language learning (DLL): Insights from behavior, cognition, and the brain

P Li, YJ Lan - Bilingualism: Language and Cognition, 2022 - cambridge.org

How can we leverage digital technologies to enhance language learning and bilingual
representation? In this digital era, our theories and practices for the learning and teaching of …

被引用次数：109 相关文章所有 10 个版本

[PDF] hal.science

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

E Vincent, S Watanabe, AA Nugraha, J Barker… - Computer Speech & …, 2017 - Elsevier

Speech enhancement and automatic speech recognition (ASR) are most often evaluated in
matched (or multi-condition) settings where the acoustic conditions of the training data …

被引用次数：410 相关文章所有 16 个版本

[PDF] ieee.org

Progressive tandem learning for pattern recognition with deep spiking neural networks

J Wu, C Xu, X Han, D Zhou, M Zhang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Spiking neural networks (SNNs) have shown clear advantages over traditional artificial
neural networks (ANNs) for low latency and high computational efficiency, due to their event …

被引用次数：114 相关文章所有 7 个版本

[PDF] ieee.org

Spex: Multi-scale time domain speaker extraction network

C Xu, W Rao, ES Chng, H Li - IEEE/ACM transactions on audio …, 2020 - ieeexplore.ieee.org

Speaker extraction aims to mimic humans' selective auditory attention by extracting a target
speaker's voice from a multi-talker environment. It is common to perform the extraction in …

被引用次数：157 相关文章所有 6 个版本

[PDF] arxiv.org

Far-field automatic speech recognition

R Haeb-Umbach, J Heymann, L Drude… - Proceedings of the …, 2020 - ieeexplore.ieee.org

The machine recognition of speech spoken at a distance from the microphones, known as
far-field automatic speech recognition (ASR), has received a significant increase in attention …

被引用次数：101 相关文章所有 8 个版本

[PDF] arxiv.org

Audio-visual speech enhancement using multimodal deep convolutional neural networks

JC Hou, SS Wang, YH Lai, Y Tsao… - … on Emerging Topics …, 2018 - ieeexplore.ieee.org

Speech enhancement (SE) aims to reduce noise in speech signals. Most SE techniques
focus only on addressing audio information. In this paper, inspired by multimodal learning …

被引用次数：242 相关文章所有 12 个版本