An overview on perceptually motivated audio indexing and classification

G Richard, S Sundaram… - Proceedings of the …, 2013 - ieeexplore.ieee.org
An audio indexing system aims at describing audio content by identifying, labeling, or
categorizing different acoustic events. Since the resulting audio classification and indexing …

Content-based and knowledge-enriched representations for classification across modalities: a survey

N Pittaras, G Giannakopoulos, P Stamatopoulos… - ACM Computing …, 2023 - dl.acm.org
This survey documents representation approaches for classification across different
modalities, from purely content-based methods to techniques utilizing external sources of …

Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity

T Nakano, K Yoshii, M Goto - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
This paper presents a vocal timbre analysis method based on topic modeling using latent
Dirichlet allocation (LDA). Although many works have focused on analyzing characteristics …

Analysis of engagement behavior in children during dyadic interactions using prosodic cues

R Gupta, D Bone, S Lee, S Narayanan - Computer speech & language, 2016 - Elsevier
Child engagement is defined as the interaction of a child with his/her environment in a
contextually appropriate manner. Engagement behavior in children is linked to socio …

Bayesian nonparametric learning for hierarchical and sparse topics

JT Chien - IEEE/ACM Transactions on Audio, Speech, and …, 2017 - ieeexplore.ieee.org
This paper presents the Bayesian nonparametric (BNP) learning for hierarchical and sparse
topics from natural language. Traditionally, the Indian buffet process provides the BNP prior …

Revisiting Probabilistic Latent Semantic Analysis: Extensions, Challenges and Insights

P Figuera, P García Bringas - Technologies, 2024 - mdpi.com
This manuscript provides a comprehensive exploration of Probabilistic latent semantic
analysis (PLSA), highlighting its strengths, drawbacks, and challenges. The PLSA, originally …

Audio scene recognition based on audio events and topic model

Y Leng, N Zhou, C Sun, X Xu, Q Yuan, C Cheng… - Knowledge-Based …, 2017 - Elsevier
Topic model is a hot research topic which is attracting attentions from many fields. Recently,
several studies have applied topic model to ASR (audio scene recognition). Among these …

Volcano-seismic events classification using document classification strategies

M Bicego, JM Londoño-Bonilla… - Image Analysis and …, 2015 - Springer
In this paper we propose a novel framework for the classification of volcano-seismic events,
based on strategies and concepts typically employed to classify documents–subsequently …

On-line genre classification of TV programs using audio content

S Kim, P Georgiou, S Narayanan - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
Automatic genre classification of TV programs can benefit users in various ways such as
allowing for rapid selection of multimedia content. In this paper, we introduce an on-line …

Hierarchical representation based on Bayesian nonparametric tree-structured mixture model for playing technique classification

SH Chen, SH Wu, YS Lee, R Lo, JC Wang - Proceedings of the on …, 2017 - dl.acm.org
This work develops a topic model-based hierarchical representation for identifying the latent
characteristics behind the frame-level musical features. Frame-level features and music clips …