Latent acoustic topic models for unstructured audio classification

G Richard, S Sundaram… - Proceedings of the …, 2013 - ieeexplore.ieee.org

An audio indexing system aims at describing audio content by identifying, labeling, or
categorizing different acoustic events. Since the resulting audio classification and indexing …

被引用次数：65 相关文章所有 7 个版本

[PDF] acm.org

Content-based and knowledge-enriched representations for classification across modalities: a survey

N Pittaras, G Giannakopoulos, P Stamatopoulos… - ACM Computing …, 2023 - dl.acm.org

This survey documents representation approaches for classification across different
modalities, from purely content-based methods to techniques utilizing external sources of …

被引用次数：2 相关文章所有 3 个版本

[PDF] aist.go.jp

Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity

T Nakano, K Yoshii, M Goto - 2014 IEEE International …, 2014 - ieeexplore.ieee.org

This paper presents a vocal timbre analysis method based on topic modeling using latent
Dirichlet allocation (LDA). Although many works have focused on analyzing characteristics …

被引用次数：33 相关文章所有 10 个版本

[PDF] sciencedirect.com

Analysis of engagement behavior in children during dyadic interactions using prosodic cues

R Gupta, D Bone, S Lee, S Narayanan - Computer speech & language, 2016 - Elsevier

Child engagement is defined as the interaction of a child with his/her environment in a
contextually appropriate manner. Engagement behavior in children is linked to socio …

被引用次数：28 相关文章所有 9 个版本

Bayesian nonparametric learning for hierarchical and sparse topics

JT Chien - IEEE/ACM Transactions on Audio, Speech, and …, 2017 - ieeexplore.ieee.org

This paper presents the Bayesian nonparametric (BNP) learning for hierarchical and sparse
topics from natural language. Traditionally, the Indian buffet process provides the BNP prior …

被引用次数：14 相关文章所有 3 个版本

[PDF] mdpi.com

Revisiting Probabilistic Latent Semantic Analysis: Extensions, Challenges and Insights

P Figuera, P García Bringas - Technologies, 2024 - mdpi.com

This manuscript provides a comprehensive exploration of Probabilistic latent semantic
analysis (PLSA), highlighting its strengths, drawbacks, and challenges. The PLSA, originally …

Audio scene recognition based on audio events and topic model

Y Leng, N Zhou, C Sun, X Xu, Q Yuan, C Cheng… - Knowledge-Based …, 2017 - Elsevier

Topic model is a hot research topic which is attracting attentions from many fields. Recently,
several studies have applied topic model to ASR (audio scene recognition). Among these …

被引用次数：11 相关文章所有 3 个版本

[PDF] univr.it

Volcano-seismic events classification using document classification strategies

M Bicego, JM Londoño-Bonilla… - Image Analysis and …, 2015 - Springer

In this paper we propose a novel framework for the classification of volcano-seismic events,
based on strategies and concepts typically employed to classify documents–subsequently …

被引用次数：11 相关文章所有 7 个版本

[PDF] usc.edu

On-line genre classification of TV programs using audio content

S Kim, P Georgiou, S Narayanan - 2013 IEEE International …, 2013 - ieeexplore.ieee.org

Automatic genre classification of TV programs can benefit users in various ways such as
allowing for rapid selection of multimedia content. In this paper, we introduce an on-line …

被引用次数：14 相关文章所有 5 个版本

Hierarchical representation based on Bayesian nonparametric tree-structured mixture model for playing technique classification

SH Chen, SH Wu, YS Lee, R Lo, JC Wang - Proceedings of the on …, 2017 - dl.acm.org

This work develops a topic model-based hierarchical representation for identifying the latent
characteristics behind the frame-level musical features. Frame-level features and music clips …

被引用次数：4 相关文章