Spatial audio signal processing for binaural reproduction of recorded acoustic scenes–review and challenges
Spatial audio has been studied for several decades, but has seen much renewed interest
recently due to advances in both software and hardware for capture and playback, and the …
recently due to advances in both software and hardware for capture and playback, and the …
Revise: Self-supervised speech resynthesis with visual input for universal and generalized speech regeneration
Prior works on improving speech quality with visual input typically study each type of
auditory distortion separately (eg, separation, inpainting, video-to-speech) and present …
auditory distortion separately (eg, separation, inpainting, video-to-speech) and present …
Revise: Self-supervised speech resynthesis with visual input for universal and generalized speech enhancement
Prior works on improving speech quality with visual input typically study each type of
auditory distortion separately (eg, separation, inpainting, video-to-speech) and present …
auditory distortion separately (eg, separation, inpainting, video-to-speech) and present …
Learning-based Array Configuration-Independent Binaural Audio Telepresence with Scalable Signal Enhancement and Ambience Preservation
Y Hsu, MR Bai - arXiv preprint arXiv:2311.12706, 2023 - arxiv.org
Audio Telepresence (AT) aims to create an immersive experience of the audio scene at the
far end for the user (s) at the near end. The application of AT could encompass scenarios …
far end for the user (s) at the near end. The application of AT could encompass scenarios …
Model-matching principle applied to the design of an array-based all-neural binaural rendering system for audio telepresence
Y Hsu, C Ma, MR Bai - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Telepresence aims to create an immersive but virtual experience of the audio and visual
scene at the far-end for users at the near-end. In this contribution, we propose an array …
scene at the far-end for users at the near-end. In this contribution, we propose an array …
Multi-Channel to Multi-Channel Noise Reduction and Reverberant Speech Preservation in Time-Varying Acoustic Scenes for Binaural Reproduction
M Lugasi, J Donley, A Menon… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Real-life acoustic scenes may be recorded with microphone arrays for spatial audio
applications, especially for the purpose of reproducing binaural signals for headphone …
applications, especially for the purpose of reproducing binaural signals for headphone …
Effortless Polite Telepresence using Intention Recognition
Telepresence technology creates the opportunity for people that were traditionally left out of
the workforce to work remotely. In the service industry, a pool of novice remote workers …
the workforce to work remotely. In the service industry, a pool of novice remote workers …
Enhancing Teleoperated Robot Customer Service through Speech Monitoring and Filtering
In this paper, we propose a system that supports operators who provide services to
customers using teleoperated robots. We observed that unprofessional or lazy operators of …
customers using teleoperated robots. We observed that unprofessional or lazy operators of …
Performance Analysis Of Binaural Signal Matching (BSM) in the Time-Frequency Domain
The capture and reproduction of spatial audio is becoming increasingly popular, with the
mushrooming of applications in teleconferencing, entertainment and virtual reality. Many …
mushrooming of applications in teleconferencing, entertainment and virtual reality. Many …
[PDF][PDF] DARE-Net: Speech dereverberation and room impulse response estimation
Speech enhancement is a common task for video calling, automatic speech recognition,
speech communications, home assistants, and audio forensics [1, 2, 3]. One component of …
speech communications, home assistants, and audio forensics [1, 2, 3]. One component of …