[HTML][HTML] Overview of geometrical room acoustic modeling techniques

L Savioja, UP Svensson - The Journal of the Acoustical Society of …, 2015 - pubs.aip.org
Computerized room acoustics modeling has been practiced for almost 50 years up to date.
These modeling techniques play an important role in room acoustic design nowadays, often …

Multi-modal multi-channel target speech separation

R Gu, SX Zhang, Y Xu, L Chen… - IEEE Journal of …, 2020 - ieeexplore.ieee.org
Target speech separation refers to extracting a target speaker's voice from an overlapped
audio of simultaneous talkers. Previously the use of visual modality for target speech …

gpuRIR: A python library for room impulse response simulation with GPU acceleration

D Diaz-Guerra, A Miguel, JR Beltran - Multimedia Tools and Applications, 2021 - Springer
Abstract The Image Source Method (ISM) is one of the most employed techniques to
calculate acoustic Room Impulse Responses (RIRs), however, its computational complexity …

Towards unified all-neural beamforming for time and frequency domain speech separation

R Gu, SX Zhang, Y Zou, D Yu - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
Recently, frequency domain all-neural beamforming methods have achieved remarkable
progress for multichannel speech separation. In parallel, the integration of time domain …

[图书][B] Sound capture and processing: practical approaches

IJ Tashev - 2009 - books.google.com
Provides state-of-the-art algorithms for sound capture, processing and enhancement Sound
Capture and Processing: Practical Approaches covers the digital signal processing …

Diffuse reverberation model for efficient image-source simulation of room impulse responses

EA Lehmann, AM Johansson - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org
In many research fields of engineering and acoustics, the image-source model represents
one of the most popular tools for the simulation of sound fields in virtual reverberant …

[HTML][HTML] A late fusion deep neural network for robust speaker identification using raw waveforms and gammatone cepstral coefficients

D Salvati, C Drioli, GL Foresti - Expert Systems with Applications, 2023 - Elsevier
Speaker identification aims at determining the speaker identity by analyzing his voice
characteristics, and relies typically on statistical models or machine learning techniques …

[图书][B] Acústica de salas: projeto e modelagem

E Brandão - 2018 - books.google.com
Este livro aborda os princípios para a modelagem e caracterização da propagação do som
em ambientes, bem como os fundamentos para o desenvolvimento de projetos de recintos …

A reverberation-time-aware approach to speech dereverberation based on deep neural networks

B Wu, K Li, M Yang, CH Lee - IEEE/ACM transactions on audio …, 2016 - ieeexplore.ieee.org
A reverberation-time-aware deep-neural-network (DNN)-based speech dereverberation
framework is proposed to handle a wide range of reverberation times. There are three key …

Complex neural spatial filter: Enhancing multi-channel target speech separation in complex domain

R Gu, SX Zhang, Y Zou, D Yu - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org
To date, mainstream target speech separation (TSS) approaches are formulated to estimate
the complex ratio mask (cRM) of target speech in time-frequency domain under supervised …