Self-supervised speech representation learning: A review
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …
necessitated the building of specialist models for individual tasks and application scenarios …
Speaker recognition by machines and humans: A tutorial review
JHL Hansen, T Hasan - IEEE Signal processing magazine, 2015 - ieeexplore.ieee.org
Identifying a person by his or her voice is an important human trait most take for granted in
natural human-to-human interaction/communication. Speaking to someone over the …
natural human-to-human interaction/communication. Speaking to someone over the …
Agnostic federated learning
A key learning scenario in large-scale applications is that of federated learning, where a
centralized model is trained based on data originating from a large number of clients. We …
centralized model is trained based on data originating from a large number of clients. We …
Scene text telescope: Text-focused scene image super-resolution
Image super-resolution, which is often regarded as a preprocessing procedure of scene text
recognition, aims to recover the realistic features from a low-resolution text image. It has …
recognition, aims to recover the realistic features from a low-resolution text image. It has …
Deep learning for environmentally robust speech recognition: An overview of recent developments
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …
research topic for automatic speech recognition but still remains an important challenge …
TED-LIUM 3: Twice as much data and corpus repartition for experiments on speaker adaptation
In this paper, we present TED-LIUM release 3 corpus (TED-LIUM 3 is available on
https://lium. univ-lemans. fr/ted-lium3/) dedicated to speech recognition in English, which …
https://lium. univ-lemans. fr/ted-lium3/) dedicated to speech recognition in English, which …
Spoofing and countermeasures for speaker verification: A survey
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …
the technology can be susceptible to malicious spoofing attacks. The research community …
Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.
DF Kleinschmidt, TF Jaeger - Psychological review, 2015 - psycnet.apa.org
Successful speech perception requires that listeners map the acoustic signal to linguistic
categories. These mappings are not only probabilistic, but change depending on the …
categories. These mappings are not only probabilistic, but change depending on the …
[HTML][HTML] Optical sensors and machine learning algorithms in sensor-based material flow characterization for mechanical recycling processes: A systematic literature …
Digital technologies hold enormous potential for improving the performance of future-
generation sorting and processing plants; however, this potential remains largely untapped …
generation sorting and processing plants; however, this potential remains largely untapped …
Deepear: robust smartphone audio sensing in unconstrained acoustic environments using deep learning
Microphones are remarkably powerful sensors of human behavior and context. However,
audio sensing is highly susceptible to wild fluctuations in accuracy when used in diverse …
audio sensing is highly susceptible to wild fluctuations in accuracy when used in diverse …