An overview on perceptually motivated audio indexing and classification
G Richard, S Sundaram… - Proceedings of the …, 2013 - ieeexplore.ieee.org
An audio indexing system aims at describing audio content by identifying, labeling, or
categorizing different acoustic events. Since the resulting audio classification and indexing …
categorizing different acoustic events. Since the resulting audio classification and indexing …
Content-based and knowledge-enriched representations for classification across modalities: a survey
This survey documents representation approaches for classification across different
modalities, from purely content-based methods to techniques utilizing external sources of …
modalities, from purely content-based methods to techniques utilizing external sources of …
Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity
This paper presents a vocal timbre analysis method based on topic modeling using latent
Dirichlet allocation (LDA). Although many works have focused on analyzing characteristics …
Dirichlet allocation (LDA). Although many works have focused on analyzing characteristics …
Analysis of engagement behavior in children during dyadic interactions using prosodic cues
Child engagement is defined as the interaction of a child with his/her environment in a
contextually appropriate manner. Engagement behavior in children is linked to socio …
contextually appropriate manner. Engagement behavior in children is linked to socio …
Bayesian nonparametric learning for hierarchical and sparse topics
JT Chien - IEEE/ACM Transactions on Audio, Speech, and …, 2017 - ieeexplore.ieee.org
This paper presents the Bayesian nonparametric (BNP) learning for hierarchical and sparse
topics from natural language. Traditionally, the Indian buffet process provides the BNP prior …
topics from natural language. Traditionally, the Indian buffet process provides the BNP prior …
Revisiting Probabilistic Latent Semantic Analysis: Extensions, Challenges and Insights
P Figuera, P García Bringas - Technologies, 2024 - mdpi.com
This manuscript provides a comprehensive exploration of Probabilistic latent semantic
analysis (PLSA), highlighting its strengths, drawbacks, and challenges. The PLSA, originally …
analysis (PLSA), highlighting its strengths, drawbacks, and challenges. The PLSA, originally …
Audio scene recognition based on audio events and topic model
Y Leng, N Zhou, C Sun, X Xu, Q Yuan, C Cheng… - Knowledge-Based …, 2017 - Elsevier
Topic model is a hot research topic which is attracting attentions from many fields. Recently,
several studies have applied topic model to ASR (audio scene recognition). Among these …
several studies have applied topic model to ASR (audio scene recognition). Among these …
Volcano-seismic events classification using document classification strategies
M Bicego, JM Londoño-Bonilla… - Image Analysis and …, 2015 - Springer
In this paper we propose a novel framework for the classification of volcano-seismic events,
based on strategies and concepts typically employed to classify documents–subsequently …
based on strategies and concepts typically employed to classify documents–subsequently …
On-line genre classification of TV programs using audio content
Automatic genre classification of TV programs can benefit users in various ways such as
allowing for rapid selection of multimedia content. In this paper, we introduce an on-line …
allowing for rapid selection of multimedia content. In this paper, we introduce an on-line …
Hierarchical representation based on Bayesian nonparametric tree-structured mixture model for playing technique classification
This work develops a topic model-based hierarchical representation for identifying the latent
characteristics behind the frame-level musical features. Frame-level features and music clips …
characteristics behind the frame-level musical features. Frame-level features and music clips …