[PDF][PDF] Montreal forced aligner: Trainable text-speech alignment using kaldi.

M McAuliffe, M Socolof, S Mihuc, M Wagner… - Interspeech, 2017 - isca-archive.org
Abstract We present the Montreal Forced Aligner (MFA), a new opensource system for
speech-text alignment. MFA is an update to the Prosodylab-Aligner, and maintains its key …

Multilingual processing of speech via web services

T Kisler, U Reichel, F Schiel - Computer Speech & Language, 2017 - Elsevier
A new software paradigmSoftware as a Service'based on web services is proposed for
multilingual linguistic tools and exemplified with the BAS CLARIN web services. Instead of …

[HTML][HTML] Electrophysiological correlates of semantic dissimilarity reflect the comprehension of natural, narrative speech

MP Broderick, AJ Anderson, GM Di Liberto, MJ Crosse… - Current Biology, 2018 - cell.com
People routinely hear and understand speech at rates of 120–200 words per minute [1, 2].
Thus, speech comprehension must involve rapid, online neural mechanisms that process …

Low-frequency cortical entrainment to speech reflects phoneme-level processing

GM Di Liberto, JA O'sullivan, EC Lalor - Current Biology, 2015 - cell.com
The human ability to understand speech is underpinned by a hierarchical auditory system
whose successive stages process increasingly complex attributes of the acoustic input. It …

Using a smartwatch and smartphone to assess early Parkinson's disease in the WATCH-PD study

JL Adams, T Kangarloo, B Tracey, P O'Donnell… - npj Parkinson's …, 2023 - nature.com
Digital health technologies can provide continuous monitoring and objective, real-world
measures of Parkinson's disease (PD), but have primarily been evaluated in small, single …

[HTML][HTML] Two distinct neural timescales for predictive speech processing

PW Donhauser, S Baillet - Neuron, 2020 - cell.com
During speech listening, the brain could use contextual predictions to optimize sensory
sampling and processing. We asked if such predictive processing is organized dynamically …

Augmented datasheets for speech datasets and ethical decision-making

O Papakyriakopoulos, ASG Choi, W Thong… - Proceedings of the …, 2023 - dl.acm.org
Speech datasets are crucial for training Speech Language Technologies (SLT); however,
the lack of diversity of the underlying training data can lead to serious limitations in building …

Joint, distributed and hierarchically organized encoding of linguistic features in the human auditory cortex

M Keshishian, S Akkol, J Herrero, S Bickel… - Nature human …, 2023 - nature.com
The precise role of the human auditory cortex in representing speech sounds and
transforming them to meaning is not yet fully understood. Here we used intracranial …

Atypical cortical entrainment to speech in the right hemisphere underpins phonemic deficits in dyslexia

GM Di Liberto, V Peter, M Kalashnikova, U Goswami… - NeuroImage, 2018 - Elsevier
Developmental dyslexia is a multifaceted disorder of learning primarily manifested by
difficulties in reading, spelling, and phonological processing. Neural studies suggest that …

Cortical tracking of surprisal during continuous speech comprehension

H Weissbart, KD Kandylaki… - Journal of cognitive …, 2020 - direct.mit.edu
Speech comprehension requires rapid online processing of a continuous acoustic signal to
extract structure and meaning. Previous studies on sentence comprehension have found …