[PDF][PDF] Montreal forced aligner: Trainable text-speech alignment using kaldi.
M McAuliffe, M Socolof, S Mihuc, M Wagner… - Interspeech, 2017 - isca-archive.org
Abstract We present the Montreal Forced Aligner (MFA), a new opensource system for
speech-text alignment. MFA is an update to the Prosodylab-Aligner, and maintains its key …
speech-text alignment. MFA is an update to the Prosodylab-Aligner, and maintains its key …
Multilingual processing of speech via web services
A new software paradigmSoftware as a Service'based on web services is proposed for
multilingual linguistic tools and exemplified with the BAS CLARIN web services. Instead of …
multilingual linguistic tools and exemplified with the BAS CLARIN web services. Instead of …
[HTML][HTML] Electrophysiological correlates of semantic dissimilarity reflect the comprehension of natural, narrative speech
People routinely hear and understand speech at rates of 120–200 words per minute [1, 2].
Thus, speech comprehension must involve rapid, online neural mechanisms that process …
Thus, speech comprehension must involve rapid, online neural mechanisms that process …
Low-frequency cortical entrainment to speech reflects phoneme-level processing
The human ability to understand speech is underpinned by a hierarchical auditory system
whose successive stages process increasingly complex attributes of the acoustic input. It …
whose successive stages process increasingly complex attributes of the acoustic input. It …
Using a smartwatch and smartphone to assess early Parkinson's disease in the WATCH-PD study
Digital health technologies can provide continuous monitoring and objective, real-world
measures of Parkinson's disease (PD), but have primarily been evaluated in small, single …
measures of Parkinson's disease (PD), but have primarily been evaluated in small, single …
[HTML][HTML] Two distinct neural timescales for predictive speech processing
PW Donhauser, S Baillet - Neuron, 2020 - cell.com
During speech listening, the brain could use contextual predictions to optimize sensory
sampling and processing. We asked if such predictive processing is organized dynamically …
sampling and processing. We asked if such predictive processing is organized dynamically …
Augmented datasheets for speech datasets and ethical decision-making
Speech datasets are crucial for training Speech Language Technologies (SLT); however,
the lack of diversity of the underlying training data can lead to serious limitations in building …
the lack of diversity of the underlying training data can lead to serious limitations in building …
Joint, distributed and hierarchically organized encoding of linguistic features in the human auditory cortex
The precise role of the human auditory cortex in representing speech sounds and
transforming them to meaning is not yet fully understood. Here we used intracranial …
transforming them to meaning is not yet fully understood. Here we used intracranial …
Atypical cortical entrainment to speech in the right hemisphere underpins phonemic deficits in dyslexia
Developmental dyslexia is a multifaceted disorder of learning primarily manifested by
difficulties in reading, spelling, and phonological processing. Neural studies suggest that …
difficulties in reading, spelling, and phonological processing. Neural studies suggest that …
Cortical tracking of surprisal during continuous speech comprehension
H Weissbart, KD Kandylaki… - Journal of cognitive …, 2020 - direct.mit.edu
Speech comprehension requires rapid online processing of a continuous acoustic signal to
extract structure and meaning. Previous studies on sentence comprehension have found …
extract structure and meaning. Previous studies on sentence comprehension have found …