[PDF][PDF] Montreal forced aligner: Trainable text-speech alignment using kaldi.

M McAuliffe, M Socolof, S Mihuc, M Wagner… - Interspeech, 2017 - isca-archive.org
Abstract We present the Montreal Forced Aligner (MFA), a new opensource system for
speech-text alignment. MFA is an update to the Prosodylab-Aligner, and maintains its key …

Whisperx: Time-accurate speech transcription of long-form audio

M Bain, J Huh, T Han, A Zisserman - arXiv preprint arXiv:2303.00747, 2023 - arxiv.org
Large-scale, weakly-supervised speech recognition models, such as Whisper, have
demonstrated impressive results on speech recognition across domains and languages …

AR-mentor: Augmented reality based mentoring system

Z Zhu, V Branzoi, M Wolverton, G Murray… - … symposium on mixed …, 2014 - ieeexplore.ieee.org
AR-Mentor is a wearable real time Augmented Reality (AR) mentoring system that is
configured to assist in maintenance and repair tasks of complex machinery, such as …

Highly accurate phonetic segmentation using boundary correction models and system fusion

A Stolcke, N Ryant, V Mitra, J Yuan… - … , Speech and Signal …, 2014 - ieeexplore.ieee.org
Accurate phone-level segmentation of speech remains an important task for many subfields
of speech research. We investigate techniques for boosting the accuracy of automatic …

Forced alignment for Nordic languages: Rapidly constructing a high-quality prototype

NJ Young, M McGarrah - Nordic Journal of Linguistics, 2023 - cambridge.org
We propose a rapid adaptation of FAVE-Align to the Nordic languages, and we offer our own
adaptation to Swedish as a template. This study is motivated by the fact that researchers of …

Automated analysis of natural speech in amyotrophic lateral sclerosis spectrum disorders

N Nevler, S Ash, C McMillan, L Elman, L McCluskey… - Neurology, 2020 - AAN Enterprises
Objective We implemented automated methods to analyze speech and evaluate the
hypothesis that cognitive and motor factors impair prosody in partially distinct ways in …

[HTML][HTML] Exploring autism spectrum disorders using HLT

J Parish-Morris, M Liberman, N Ryant… - Proceedings of the …, 2016 - ncbi.nlm.nih.gov
The phenotypic complexity of Autism Spectrum Disorder motivates the application of modern
computational methods to large collections of observational data, both for improved clinical …

Automatic detection of sociolinguistic variation using forced alignment

G Bailey - 2016 - repository.upenn.edu
Forced alignment software is now widely used in contemporary sociolinguistics, and is
quickly becoming a crucial methodological tool as an increasing number of studies begin to …

Mandarin tone classification without pitch tracking

N Ryant, J Yuan, M Liberman - 2014 IEEE international …, 2014 - ieeexplore.ieee.org
A deep neural network (DNN) based classifier achieved 27.38% frame error rate (FER) and
15.62% segment error rate (SER) in recognizing five tonal categories in Mandarin Chinese …

Comparison of two forced alignments systems for aligning bribri speech

SF Solórzano, R Coto-Solano - CLEI Electronic Journal, 2017 - clei.org
Forced alignment provides drastic savings in time when aligning speech recordings and is
particularly useful for the study of Indigenous languages, which are severely under …