[PDF][PDF] Montreal forced aligner: Trainable text-speech alignment using kaldi.
M McAuliffe, M Socolof, S Mihuc, M Wagner… - Interspeech, 2017 - isca-archive.org
Abstract We present the Montreal Forced Aligner (MFA), a new opensource system for
speech-text alignment. MFA is an update to the Prosodylab-Aligner, and maintains its key …
speech-text alignment. MFA is an update to the Prosodylab-Aligner, and maintains its key …
Whisperx: Time-accurate speech transcription of long-form audio
Large-scale, weakly-supervised speech recognition models, such as Whisper, have
demonstrated impressive results on speech recognition across domains and languages …
demonstrated impressive results on speech recognition across domains and languages …
AR-mentor: Augmented reality based mentoring system
Z Zhu, V Branzoi, M Wolverton, G Murray… - … symposium on mixed …, 2014 - ieeexplore.ieee.org
AR-Mentor is a wearable real time Augmented Reality (AR) mentoring system that is
configured to assist in maintenance and repair tasks of complex machinery, such as …
configured to assist in maintenance and repair tasks of complex machinery, such as …
Highly accurate phonetic segmentation using boundary correction models and system fusion
Accurate phone-level segmentation of speech remains an important task for many subfields
of speech research. We investigate techniques for boosting the accuracy of automatic …
of speech research. We investigate techniques for boosting the accuracy of automatic …
Forced alignment for Nordic languages: Rapidly constructing a high-quality prototype
NJ Young, M McGarrah - Nordic Journal of Linguistics, 2023 - cambridge.org
We propose a rapid adaptation of FAVE-Align to the Nordic languages, and we offer our own
adaptation to Swedish as a template. This study is motivated by the fact that researchers of …
adaptation to Swedish as a template. This study is motivated by the fact that researchers of …
Automated analysis of natural speech in amyotrophic lateral sclerosis spectrum disorders
N Nevler, S Ash, C McMillan, L Elman, L McCluskey… - Neurology, 2020 - AAN Enterprises
Objective We implemented automated methods to analyze speech and evaluate the
hypothesis that cognitive and motor factors impair prosody in partially distinct ways in …
hypothesis that cognitive and motor factors impair prosody in partially distinct ways in …
[HTML][HTML] Exploring autism spectrum disorders using HLT
The phenotypic complexity of Autism Spectrum Disorder motivates the application of modern
computational methods to large collections of observational data, both for improved clinical …
computational methods to large collections of observational data, both for improved clinical …
Automatic detection of sociolinguistic variation using forced alignment
G Bailey - 2016 - repository.upenn.edu
Forced alignment software is now widely used in contemporary sociolinguistics, and is
quickly becoming a crucial methodological tool as an increasing number of studies begin to …
quickly becoming a crucial methodological tool as an increasing number of studies begin to …
Mandarin tone classification without pitch tracking
A deep neural network (DNN) based classifier achieved 27.38% frame error rate (FER) and
15.62% segment error rate (SER) in recognizing five tonal categories in Mandarin Chinese …
15.62% segment error rate (SER) in recognizing five tonal categories in Mandarin Chinese …
Comparison of two forced alignments systems for aligning bribri speech
SF Solórzano, R Coto-Solano - CLEI Electronic Journal, 2017 - clei.org
Forced alignment provides drastic savings in time when aligning speech recordings and is
particularly useful for the study of Indigenous languages, which are severely under …
particularly useful for the study of Indigenous languages, which are severely under …