Spex: Multi-scale time domain speaker extraction network

C Xu, W Rao, ES Chng, H Li - IEEE/ACM transactions on audio …, 2020 - ieeexplore.ieee.org
Speaker extraction aims to mimic humans' selective auditory attention by extracting a target
speaker's voice from a multi-talker environment. It is common to perform the extraction in …

Spex+: A complete time domain speaker extraction network

M Ge, C Xu, L Wang, ES Chng, J Dang, H Li - arXiv preprint arXiv …, 2020 - arxiv.org
Speaker extraction aims to extract the target speech signal from a multi-talker environment
given a target speaker's reference speech. We recently proposed a time-domain solution …

Towards modelling active sound localisation based on Bayesian inference in a static environment

G McLachlan, P Majdak, J Reijniers… - Acta …, 2021 - acta-acustica.edpsciences.org
Over the decades, Bayesian statistical inference has become a staple technique for
modelling human multisensory perception. Many studies have successfully shown how …

Mechanisms of auditory masking in marine mammals

BK Branstetter, JM Sills - Animal Cognition, 2022 - Springer
Anthropogenic noise is an increasing threat to marine mammals that rely on sound for
communication, navigation, detecting prey and predators, and finding mates. Auditory …

Concurrent temporal channels for auditory processing: Oscillatory neural entrainment reveals segregation of function at different scales

X Teng, X Tian, J Rowland, D Poeppel - PLoS biology, 2017 - journals.plos.org
Natural sounds convey perceptually relevant information over multiple timescales, and the
necessary extraction of multi-timescale information requires the auditory system to work over …

[HTML][HTML] Successes and critical failures of neural networks in capturing human-like speech recognition

F Adolfi, JS Bowers, D Poeppel - Neural Networks, 2023 - Elsevier
Natural and artificial audition can in principle acquire different solutions to a given problem.
The constraints of the task, however, can nudge the cognitive science and engineering of …

The Weber–Fechner law: a misnomer that persists but that should go away.

D Algom - Psychological review, 2021 - psycnet.apa.org
Abstract The term “Weber–Fechner law” is arguably the most widely used misnomer in
psychological science. The unification reflects a failure to appreciate the logical …

Theta band oscillations reflect more than entrainment: behavioral and neural evidence demonstrates an active chunking process

X Teng, X Tian, K Doelling… - European Journal of …, 2018 - Wiley Online Library
Parsing continuous acoustic streams into perceptual units is fundamental to auditory
perception. Previous studies have uncovered a cortical entrainment mechanism in the delta …

Theta and gamma bands encode acoustic dynamics over wide-ranging timescales

X Teng, D Poeppel - Cerebral cortex, 2020 - academic.oup.com
Natural sounds contain acoustic dynamics ranging from tens to hundreds of milliseconds.
How does the human auditory system encode acoustic information over wide-ranging …

Asymmetric sampling in human auditory cortex reveals spectral processing hierarchy

J Giroud, A Trébuchon, D Schön, P Marquis… - PLoS …, 2020 - journals.plos.org
Speech perception is mediated by both left and right auditory cortices but with differential
sensitivity to specific acoustic information contained in the speech signal. A detailed …