Novel-view acoustic synthesis from 3D reconstructed rooms

B Ahn, K Yang, B Hamilton, J Sheaffer… - arXiv preprint arXiv …, 2023 - arxiv.org
We investigate the benefit of combining blind audio recordings with 3D scene information for
novel-view acoustic synthesis. Given audio recordings from 2-4 microphones and the 3D …

Can Large Language Models Understand Spatial Audio?

C Tang, W Yu, G Sun, X Chen, T Tan, W Li… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper explores enabling large language models (LLMs) to understand spatial
information from multichannel audio, a skill currently lacking in auditory LLMs. By leveraging …

Multi-Channel Mosra: Mean Opinion Score and Room Acoustics Estimation Using Simulated Data and A Teacher Model

J Coldenhoff, A Harper, P Kendrick… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Previous methods for predicting room acoustic parameters and speech quality metrics have
focused on the single-channel case, where room acoustics and Mean Opinion Score (MOS) …