Spatial audio signal processing for binaural reproduction of recorded acoustic scenes–review and challenges

B Rafaely, V Tourbabin, E Habets… - Acta …, 2022 - acta-acustica.edpsciences.org
Spatial audio has been studied for several decades, but has seen much renewed interest
recently due to advances in both software and hardware for capture and playback, and the …

Revise: Self-supervised speech resynthesis with visual input for universal and generalized speech regeneration

WN Hsu, T Remez, B Shi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Prior works on improving speech quality with visual input typically study each type of
auditory distortion separately (eg, separation, inpainting, video-to-speech) and present …

Revise: Self-supervised speech resynthesis with visual input for universal and generalized speech enhancement

WN Hsu, T Remez, B Shi, J Donley, Y Adi - arXiv preprint arXiv …, 2022 - arxiv.org
Prior works on improving speech quality with visual input typically study each type of
auditory distortion separately (eg, separation, inpainting, video-to-speech) and present …

Learning-based Array Configuration-Independent Binaural Audio Telepresence with Scalable Signal Enhancement and Ambience Preservation

Y Hsu, MR Bai - arXiv preprint arXiv:2311.12706, 2023 - arxiv.org
Audio Telepresence (AT) aims to create an immersive experience of the audio scene at the
far end for the user (s) at the near end. The application of AT could encompass scenarios …

Model-matching principle applied to the design of an array-based all-neural binaural rendering system for audio telepresence

Y Hsu, C Ma, MR Bai - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Telepresence aims to create an immersive but virtual experience of the audio and visual
scene at the far-end for users at the near-end. In this contribution, we propose an array …

Multi-Channel to Multi-Channel Noise Reduction and Reverberant Speech Preservation in Time-Varying Acoustic Scenes for Binaural Reproduction

M Lugasi, J Donley, A Menon… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Real-life acoustic scenes may be recorded with microphone arrays for spatial audio
applications, especially for the purpose of reproducing binaural signals for headphone …

Effortless Polite Telepresence using Intention Recognition

M Daneshmand, J Even, T Kanda - ACM Transactions on Human-Robot …, 2024 - dl.acm.org
Telepresence technology creates the opportunity for people that were traditionally left out of
the workforce to work remotely. In the service industry, a pool of novice remote workers …

Enhancing Teleoperated Robot Customer Service through Speech Monitoring and Filtering

K Yamada, J Even, T Kanda - 2023 IEEE/RSJ International …, 2023 - ieeexplore.ieee.org
In this paper, we propose a system that supports operators who provide services to
customers using teleoperated robots. We observed that unprofessional or lazy operators of …

Performance Analysis Of Binaural Signal Matching (BSM) in the Time-Frequency Domain

A Berger, V Tourbabin, J Donley, Z Ben-Hur… - arXiv preprint arXiv …, 2023 - arxiv.org
The capture and reproduction of spatial audio is becoming increasingly popular, with the
mushrooming of applications in teleconferencing, entertainment and virtual reality. Many …

[PDF][PDF] DARE-Net: Speech dereverberation and room impulse response estimation

J Donley, P Calamia - 2022 - cs230.stanford.edu
Speech enhancement is a common task for video calling, automatic speech recognition,
speech communications, home assistants, and audio forensics [1, 2, 3]. One component of …