Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Speaker recognition by machines and humans: A tutorial review

JHL Hansen, T Hasan - IEEE Signal processing magazine, 2015 - ieeexplore.ieee.org
Identifying a person by his or her voice is an important human trait most take for granted in
natural human-to-human interaction/communication. Speaking to someone over the …

Agnostic federated learning

M Mohri, G Sivek, AT Suresh - International Conference on …, 2019 - proceedings.mlr.press
A key learning scenario in large-scale applications is that of federated learning, where a
centralized model is trained based on data originating from a large number of clients. We …

Scene text telescope: Text-focused scene image super-resolution

J Chen, B Li, X Xue - … of the IEEE/CVF Conference on …, 2021 - openaccess.thecvf.com
Image super-resolution, which is often regarded as a preprocessing procedure of scene text
recognition, aims to recover the realistic features from a low-resolution text image. It has …

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

TED-LIUM 3: Twice as much data and corpus repartition for experiments on speaker adaptation

F Hernandez, V Nguyen, S Ghannay… - Speech and Computer …, 2018 - Springer
In this paper, we present TED-LIUM release 3 corpus (TED-LIUM 3 is available on
https://lium. univ-lemans. fr/ted-lium3/) dedicated to speech recognition in English, which …

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.

DF Kleinschmidt, TF Jaeger - Psychological review, 2015 - psycnet.apa.org
Successful speech perception requires that listeners map the acoustic signal to linguistic
categories. These mappings are not only probabilistic, but change depending on the …

[HTML][HTML] Optical sensors and machine learning algorithms in sensor-based material flow characterization for mechanical recycling processes: A systematic literature …

N Kroell, X Chen, K Greiff, A Feil - Waste Management, 2022 - Elsevier
Digital technologies hold enormous potential for improving the performance of future-
generation sorting and processing plants; however, this potential remains largely untapped …

Deepear: robust smartphone audio sensing in unconstrained acoustic environments using deep learning

ND Lane, P Georgiev, L Qendro - … of the 2015 ACM international joint …, 2015 - dl.acm.org
Microphones are remarkably powerful sensors of human behavior and context. However,
audio sensing is highly susceptible to wild fluctuations in accuracy when used in diverse …