Significance of anchor speaker segments for constructing extractive audio summaries of broadcast...

T Theodorou, I Mporas… - International Journal of …, 2014 - scholar.archive.org

In this report we present an overview of the approaches and techniques that are used in the
task of automatic audio segmentation. Audio segmentation aims to find changing points in …

被引用次数：76 相关文章

[PDF] arxiv.org

Automatic speech summarisation: A scoping review

D Rezazadegan, S Berkovsky, JC Quiroz… - arXiv preprint arXiv …, 2020 - arxiv.org

Speech summarisation techniques take human speech as input and then output an
abridged version as text or speech. Speech summarisation has applications in many …

被引用次数：12 相关文章所有 4 个版本

[PDF] github.io

Symbolic and statistical learning approaches to speech summarization: A scoping review

D Rezazadegan, S Berkovsky, JC Quiroz… - Computer Speech & …, 2022 - Elsevier

Speech summarization techniques take human speech as input and then output an
abridged version as text or speech. Speech summarization has applications in many …

被引用次数：3 相关文章所有 6 个版本

[图书][B] Self-learning speaker identification: a system for enhanced speech recognition

T Herbig, F Gerl, W Minker - 2011 - books.google.com

Current speech recognition systems are based on speaker independent speech models and
suffer from inter-speaker variations in speech signal characteristics. This work develops an …

被引用次数：24 相关文章所有 7 个版本

[PDF] iczhiku.com

The real-time UML standard: definition and application

B Selic - Proceedings 2002 Design, Automation and Test in …, 2002 - ieeexplore.ieee.org

This paper describes briefly the objectives, content, and usage of a real-time UML profile
that has been standardized by the Object Management Group. This profile defines a …

被引用次数：34 相关文章所有 20 个版本

Context-based environmental audio event recognition for scene understanding

T Lu, G Wang, F Su - Multimedia Systems, 2015 - Springer

Automatic audio content recognition has attracted an increasing attention for developing
multimedia systems, for which the most popular approaches combine frame-based features …

被引用次数：11 相关文章所有 5 个版本

[PDF] cnrs.fr

Audiotory movie summarization by detecting scene changes and sound events

T Lu, Y Weng, G Wang - 2014 22nd International Conference …, 2014 - ieeexplore.ieee.org

A novel movie audio summarization framework is presented, which consists of three
processing levels, namely, low-level audio feature extraction, mid-level audio event …

被引用次数：4 相关文章所有 4 个版本

Adaptive systems for unsupervised speaker tracking and speech recognition

T Herbig, F Gerl, W Minker, R Haeb-Umbach - Evolving Systems, 2011 - Springer

Speech recognition offers an intuitive and convenient interface to control technical devices.
Improvements achieved through ongoing research activities enable the user to handle …

被引用次数：5 相关文章所有 5 个版本

[PDF] epfl.ch

Towards Chapterisation of Podcasts Detection of Host and Structuring Questions in Radio Transcripts

M Piguet - 2024 - infoscience.epfl.ch

This Master thesis investigates the application of Bidirectional Encoder Representations
from Transformers (BERT) on podcast to identify the host and detect structuring questions …

Data-driven audio feature space clustering for automatic sound recognition in radio broadcast news

T Theodorou, I Mporas, A Lazaridis… - International Journal on …, 2017 - World Scientific

Aiming to an automatic sound recognizer for radio broadcasting events, a methodology of
clustering the audio feature space using the discrimination ability of the audio descriptors as …

被引用次数：2 相关文章所有 10 个版本