Pattern analysis based acoustic signal processing: a survey of the state-of-art

J Chaki - International Journal of Speech Technology, 2021 - Springer
Audio signal processing is the most challenging field in the current era for an analysis of an
audio signal. Audio signal classification (ASC) comprises of generating appropriate features …

Investigating the effects of training set synthesis for audio segmentation of radio broadcast

S Venkatesh, D Moffat, ER Miranda - Electronics, 2021 - mdpi.com
Music and speech detection provides us valuable information regarding the nature of
content in broadcast audio. It helps detect acoustic regions that contain speech, voice over …

Artificially synthesising data for audio classification and segmentation to improve speech and music detection in radio broadcast

S Venkatesh, D Moffat, A Kirke… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Segmenting audio into homogeneous sections such as music and speech helps us
understand the content of audio. It is useful as a pre-processing step to index, store, and …

Dialogue Enhancement with MPEG-H Audio: An update on Technology and Adoption

D Rieger, C Simon, M Torcoli, H Fuchs - Audio Engineering Society …, 2023 - aes.org
Difficulties in following speech on TV due to loud background sounds are a common issue in
broadcasting. Objectbased audio (OBA) systems like MPEG-H Audio can solve this problem …

Augmenting TV viewing using acoustically transparent auditory headsets

M McGill, F Mathis, M Khamis… - Proceedings of the 2020 …, 2020 - dl.acm.org
This paper explores how acoustically transparent auditory headsets can improve TV viewing
by intermixing headset and TV audio, facilitating personal, private auditory enhancements …

Casualty accessible and enhanced (A&E) audio: trialling object-based accessible TV audio

L Ward, M Paradis, B Shirley, L Russon… - … Society Convention 147, 2019 - aes.org
Casualty Accessible and Enhanced (A&E) Audio is the first public trial of accessible audio
technology using a narrative importance approach. This trial allows viewers to personalize …

Preferred levels for background ducking to produce esthetically pleasing audio for TV with clear speech

M Torcoli, A Freke-Morin… - Journal of the …, 2019 - salford-repository.worktribe.com
In audio production, background ducking facilitates speech intelligibility while allowing the
background to fulfill its purpose, eg, to create ambience, set the mood, or convey semantic …

Audio description personalisation

P Orero - The Routledge Handbook of Audio Description, 2022 - taylorfrancis.com
Nowadays most audio description components may be altered with a view to achieving a
higher level of interaction with the audience, the venue requirements, the media genres and …

Dialogue Understandability: Why are we streaming movies with subtitles?

H Becerra, A Ragano, D Debnath, A Ullah… - arXiv preprint arXiv …, 2024 - arxiv.org
Watching movies and TV shows with subtitles enabled is not simply down to audibility or
speech intelligibility. A variety of evolving factors related to technological advances, cinema …

Loudness differences for voice-over-voice audio in TV and streaming

D Geary, M Torcoli, J Paulus, C Simon… - journal of the audio …, 2020 - aes.org
Voice-over-Voice (VoV) is a common mixing practice observed in news reports and
documentaries, where a foreground voice is mixed on top of a background voice, eg, to …