Spoken content retrieval—beyond cascading speech recognition with text retrieval
Spoken content retrieval refers to directly indexing and retrieving spoken content based on
the audio rather than text descriptions. This potentially eliminates the requirement of …
the audio rather than text descriptions. This potentially eliminates the requirement of …
[PDF][PDF] Parallel inference of dirichlet process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study.
We adopt a Dirichlet process Gaussian mixture model (DPGMM) for unsupervised acoustic
modeling and represent speech frames with Gaussian posteriorgrams. The model performs …
modeling and represent speech frames with Gaussian posteriorgrams. The model performs …
Learning acoustic word embeddings with temporal context for query-by-example speech search
We propose to learn acoustic word embeddings with temporal context for query-by-example
(QbE) speech search. The temporal context includes the leading and trailing word …
(QbE) speech search. The temporal context includes the leading and trailing word …
[PDF][PDF] Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection.
We propose a framework which ports Dirichlet Gaussian mixture model (DPGMM) based
labels to deep neural network (DNN). The DNN trained using the unsupervised labels is …
labels to deep neural network (DNN). The DNN trained using the unsupervised labels is …
Pairwise learning using multi-lingual bottleneck features for low-resource query-by-example spoken term detection
We propose to use a feature representation obtained by pairwise learning in a low-resource
language for query-by-example spoken term detection (QbE-STD). We assume that word …
language for query-by-example spoken term detection (QbE-STD). We assume that word …
[HTML][HTML] The multi-domain international search on speech 2020 albayzin evaluation: Overview, systems, results, discussion and post-evaluation analyses
The large amount of information stored in audio and video repositories makes search on
speech (SoS) a challenging area that is continuously receiving much interest. Within SoS …
speech (SoS) a challenging area that is continuously receiving much interest. Within SoS …
Investigating neural network based query-by-example keyword spotting approach for personalized wake-up word detection in Mandarin Chinese
We use query-by-example keyword spotting (QbyE-KWS) approach to solve the
personalized wake-up word detection problem for small-footprint, low-computational cost on …
personalized wake-up word detection problem for small-footprint, low-computational cost on …
Partial matching and search space reduction for QbE-STD
MC Madhavi, HA Patil - Computer Speech & Language, 2017 - Elsevier
Query-by-Example approach of spoken content retrieval has gained much attention because
of its feasibility in the absence of speech recognition and its applicability in a multilingual …
of its feasibility in the absence of speech recognition and its applicability in a multilingual …
[PDF][PDF] The NNI Query-by-Example System for MediaEval 2014.
In this paper we describe the system proposed by NNI (NWPUNTU-I2R) team for the
QUESST task within the Mediaeval 2014 evaluation. To solve the problem, we used both …
QUESST task within the Mediaeval 2014 evaluation. To solve the problem, we used both …
[PDF][PDF] Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis.
This paper documents the significant components of a state-ofthe-art language-independent
query-by-example spoken term detection system designed for the Query by Example Search …
query-by-example spoken term detection system designed for the Query by Example Search …