Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org

Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

被引用次数：261 相关文章所有 9 个版本

[PDF] academia.edu

Speaker recognition by machines and humans: A tutorial review

JHL Hansen, T Hasan - IEEE Signal processing magazine, 2015 - ieeexplore.ieee.org

Identifying a person by his or her voice is an important human trait most take for granted in
natural human-to-human interaction/communication. Speaking to someone over the …

被引用次数：768 相关文章所有 6 个版本

[PDF] mlr.press

Agnostic federated learning

M Mohri, G Sivek, AT Suresh - International Conference on …, 2019 - proceedings.mlr.press

A key learning scenario in large-scale applications is that of federated learning, where a
centralized model is trained based on data originating from a large number of clients. We …

被引用次数：931 相关文章所有 6 个版本

[PDF] thecvf.com

Scene text telescope: Text-focused scene image super-resolution

J Chen, B Li, X Xue - … of the IEEE/CVF Conference on …, 2021 - openaccess.thecvf.com

Image super-resolution, which is often regarded as a preprocessing procedure of scene text
recognition, aims to recover the realistic features from a low-resolution text image. It has …

被引用次数：103 相关文章所有 7 个版本

[PDF] arxiv.org

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org

Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

被引用次数：389 相关文章所有 10 个版本

[PDF] arxiv.org

TED-LIUM 3: Twice as much data and corpus repartition for experiments on speaker adaptation

F Hernandez, V Nguyen, S Ghannay… - Speech and Computer …, 2018 - Springer

In this paper, we present TED-LIUM release 3 corpus (TED-LIUM 3 is available on
https://lium. univ-lemans. fr/ted-lium3/) dedicated to speech recognition in English, which …

被引用次数：312 相关文章所有 7 个版本

[PDF] drwuz.com

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier

While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

被引用次数：711 相关文章所有 13 个版本

[PDF] apa.org

Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.

DF Kleinschmidt, TF Jaeger - Psychological review, 2015 - psycnet.apa.org

Successful speech perception requires that listeners map the acoustic signal to linguistic
categories. These mappings are not only probabilistic, but change depending on the …

被引用次数：673 相关文章所有 13 个版本

[HTML] sciencedirect.com

[HTML][HTML] Optical sensors and machine learning algorithms in sensor-based material flow characterization for mechanical recycling processes: A systematic literature …

N Kroell, X Chen, K Greiff, A Feil - Waste Management, 2022 - Elsevier

Digital technologies hold enormous potential for improving the performance of future-
generation sorting and processing plants; however, this potential remains largely untapped …

被引用次数：51 相关文章所有 8 个版本

[PDF] ucl.ac.uk

Deepear: robust smartphone audio sensing in unconstrained acoustic environments using deep learning

ND Lane, P Georgiev, L Qendro - … of the 2015 ACM international joint …, 2015 - dl.acm.org

Microphones are remarkably powerful sensors of human behavior and context. However,
audio sensing is highly susceptible to wild fluctuations in accuracy when used in diverse …

被引用次数：408 相关文章所有 7 个版本