Spex: Multi-scale time domain speaker extraction network
Speaker extraction aims to mimic humans' selective auditory attention by extracting a target
speaker's voice from a multi-talker environment. It is common to perform the extraction in …
speaker's voice from a multi-talker environment. It is common to perform the extraction in …
Spex+: A complete time domain speaker extraction network
Speaker extraction aims to extract the target speech signal from a multi-talker environment
given a target speaker's reference speech. We recently proposed a time-domain solution …
given a target speaker's reference speech. We recently proposed a time-domain solution …
Towards modelling active sound localisation based on Bayesian inference in a static environment
G McLachlan, P Majdak, J Reijniers… - Acta …, 2021 - acta-acustica.edpsciences.org
Over the decades, Bayesian statistical inference has become a staple technique for
modelling human multisensory perception. Many studies have successfully shown how …
modelling human multisensory perception. Many studies have successfully shown how …
Mechanisms of auditory masking in marine mammals
BK Branstetter, JM Sills - Animal Cognition, 2022 - Springer
Anthropogenic noise is an increasing threat to marine mammals that rely on sound for
communication, navigation, detecting prey and predators, and finding mates. Auditory …
communication, navigation, detecting prey and predators, and finding mates. Auditory …
Concurrent temporal channels for auditory processing: Oscillatory neural entrainment reveals segregation of function at different scales
Natural sounds convey perceptually relevant information over multiple timescales, and the
necessary extraction of multi-timescale information requires the auditory system to work over …
necessary extraction of multi-timescale information requires the auditory system to work over …
[HTML][HTML] Successes and critical failures of neural networks in capturing human-like speech recognition
Natural and artificial audition can in principle acquire different solutions to a given problem.
The constraints of the task, however, can nudge the cognitive science and engineering of …
The constraints of the task, however, can nudge the cognitive science and engineering of …
The Weber–Fechner law: a misnomer that persists but that should go away.
D Algom - Psychological review, 2021 - psycnet.apa.org
Abstract The term “Weber–Fechner law” is arguably the most widely used misnomer in
psychological science. The unification reflects a failure to appreciate the logical …
psychological science. The unification reflects a failure to appreciate the logical …
Theta band oscillations reflect more than entrainment: behavioral and neural evidence demonstrates an active chunking process
Parsing continuous acoustic streams into perceptual units is fundamental to auditory
perception. Previous studies have uncovered a cortical entrainment mechanism in the delta …
perception. Previous studies have uncovered a cortical entrainment mechanism in the delta …
Theta and gamma bands encode acoustic dynamics over wide-ranging timescales
Natural sounds contain acoustic dynamics ranging from tens to hundreds of milliseconds.
How does the human auditory system encode acoustic information over wide-ranging …
How does the human auditory system encode acoustic information over wide-ranging …
Asymmetric sampling in human auditory cortex reveals spectral processing hierarchy
Speech perception is mediated by both left and right auditory cortices but with differential
sensitivity to specific acoustic information contained in the speech signal. A detailed …
sensitivity to specific acoustic information contained in the speech signal. A detailed …