Spoken content retrieval—beyond cascading speech recognition with text retrieval

L Lee, J Glass, H Lee, C Chan - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
Spoken content retrieval refers to directly indexing and retrieving spoken content based on
the audio rather than text descriptions. This potentially eliminates the requirement of …

[PDF][PDF] Parallel inference of dirichlet process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study.

H Chen, CC Leung, L Xie, B Ma, H Li - INTERSPEECH, 2015 - isca-archive.org
We adopt a Dirichlet process Gaussian mixture model (DPGMM) for unsupervised acoustic
modeling and represent speech frames with Gaussian posteriorgrams. The model performs …

Learning acoustic word embeddings with temporal context for query-by-example speech search

Y Yuan, CC Leung, L Xie, H Chen, B Ma… - arXiv preprint arXiv …, 2018 - arxiv.org
We propose to learn acoustic word embeddings with temporal context for query-by-example
(QbE) speech search. The temporal context includes the leading and trailing word …

[PDF][PDF] Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection.

H Chen, CC Leung, L Xie, B Ma, H Li - Interspeech, 2016 - researchgate.net
We propose a framework which ports Dirichlet Gaussian mixture model (DPGMM) based
labels to deep neural network (DNN). The DNN trained using the unsupervised labels is …

Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection

Y Yuan, CC Leung, L Xie, H Chen… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
We propose to use a feature representation obtained by pairwise learning in a low-resource
language for query-by-example spoken term detection (QbE-STD). We assume that word …

[HTML][HTML] The multi-domain international search on speech 2020 albayzin evaluation: Overview, systems, results, discussion and post-evaluation analyses

J Tejedor, DT Toledano, JM Ramirez, AR Montalvo… - Applied Sciences, 2021 - mdpi.com
The large amount of information stored in audio and video repositories makes search on
speech (SoS) a challenging area that is continuously receiving much interest. Within SoS …

Investigating neural network based query-by-example keyword spotting approach for personalized wake-up word detection in Mandarin Chinese

J Hou, L Xie, Z Fu - 2016 10th international symposium on …, 2016 - ieeexplore.ieee.org
We use query-by-example keyword spotting (QbyE-KWS) approach to solve the
personalized wake-up word detection problem for small-footprint, low-computational cost on …

Partial matching and search space reduction for QbE-STD

MC Madhavi, HA Patil - Computer Speech & Language, 2017 - Elsevier
Query-by-Example approach of spoken content retrieval has gained much attention because
of its feasibility in the absence of speech recognition and its applicability in a multilingual …

[PDF][PDF] The NNI Query-by-Example System for MediaEval 2014.

P Yang, H Xu, X Xiao, L Xie, CC Leung, H Chen, J Yu… - MediaEval, 2014 - ceur-ws.org
In this paper we describe the system proposed by NNI (NWPUNTU-I2R) team for the
QUESST task within the Mediaeval 2014 evaluation. To solve the problem, we used both …

[PDF][PDF] Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis.

CC Leung, L Wang, H Xu, J Hou, Van Tung Pham… - …, 2016 - researchgate.net
This paper documents the significant components of a state-ofthe-art language-independent
query-by-example spoken term detection system designed for the Query by Example Search …