[PDF][PDF] An overview of automatic audio segmentation

T Theodorou, I Mporas… - International Journal of …, 2014 - scholar.archive.org
In this report we present an overview of the approaches and techniques that are used in the
task of automatic audio segmentation. Audio segmentation aims to find changing points in …

Automatic speech summarisation: A scoping review

D Rezazadegan, S Berkovsky, JC Quiroz… - arXiv preprint arXiv …, 2020 - arxiv.org
Speech summarisation techniques take human speech as input and then output an
abridged version as text or speech. Speech summarisation has applications in many …

Symbolic and statistical learning approaches to speech summarization: A scoping review

D Rezazadegan, S Berkovsky, JC Quiroz… - Computer Speech & …, 2022 - Elsevier
Speech summarization techniques take human speech as input and then output an
abridged version as text or speech. Speech summarization has applications in many …

[图书][B] Self-learning speaker identification: a system for enhanced speech recognition

T Herbig, F Gerl, W Minker - 2011 - books.google.com
Current speech recognition systems are based on speaker independent speech models and
suffer from inter-speaker variations in speech signal characteristics. This work develops an …

The real-time UML standard: definition and application

B Selic - Proceedings 2002 Design, Automation and Test in …, 2002 - ieeexplore.ieee.org
This paper describes briefly the objectives, content, and usage of a real-time UML profile
that has been standardized by the Object Management Group. This profile defines a …

Context-based environmental audio event recognition for scene understanding

T Lu, G Wang, F Su - Multimedia Systems, 2015 - Springer
Automatic audio content recognition has attracted an increasing attention for developing
multimedia systems, for which the most popular approaches combine frame-based features …

Audiotory movie summarization by detecting scene changes and sound events

T Lu, Y Weng, G Wang - 2014 22nd International Conference …, 2014 - ieeexplore.ieee.org
A novel movie audio summarization framework is presented, which consists of three
processing levels, namely, low-level audio feature extraction, mid-level audio event …

Adaptive systems for unsupervised speaker tracking and speech recognition

T Herbig, F Gerl, W Minker, R Haeb-Umbach - Evolving Systems, 2011 - Springer
Speech recognition offers an intuitive and convenient interface to control technical devices.
Improvements achieved through ongoing research activities enable the user to handle …

Towards Chapterisation of Podcasts Detection of Host and Structuring Questions in Radio Transcripts

M Piguet - 2024 - infoscience.epfl.ch
This Master thesis investigates the application of Bidirectional Encoder Representations
from Transformers (BERT) on podcast to identify the host and detect structuring questions …

Data-driven audio feature space clustering for automatic sound recognition in radio broadcast news

T Theodorou, I Mporas, A Lazaridis… - International Journal on …, 2017 - World Scientific
Aiming to an automatic sound recognizer for radio broadcasting events, a methodology of
clustering the audio feature space using the discrimination ability of the audio descriptors as …