SpeechBrain: A general-purpose speech toolkit
SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …
research and development of neural speech processing technologies by being simple …
Speaker recognition from raw waveform with sincnet
M Ravanelli, Y Bengio - 2018 IEEE spoken language …, 2018 - ieeexplore.ieee.org
Deep learning is progressively gaining popularity as a viable alternative to i-vectors for
speaker recognition. Promising results have been recently obtained with Convolutional …
speaker recognition. Promising results have been recently obtained with Convolutional …
Light gated recurrent units for speech recognition
M Ravanelli, P Brakel, M Omologo… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
A field that has directly benefited from the recent advances in deep learning is automatic
speech recognition (ASR). Despite the great achievements of the past decades, however, a …
speech recognition (ASR). Despite the great achievements of the past decades, however, a …
The pytorch-kaldi speech recognition toolkit
M Ravanelli, T Parcollet… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
The availability of open-source software is playing a remarkable role in the popularization of
speech recognition and deep learning. Kaldi, for instance, is nowadays an established …
speech recognition and deep learning. Kaldi, for instance, is nowadays an established …
Interpretable convolutional filters with sincnet
M Ravanelli, Y Bengio - arXiv preprint arXiv:1811.09725, 2018 - arxiv.org
Deep learning is currently playing a crucial role toward higher levels of artificial intelligence.
This paradigm allows neural networks to learn complex and abstract representations, that …
This paradigm allows neural networks to learn complex and abstract representations, that …
Learning speaker representations with mutual information
M Ravanelli, Y Bengio - arXiv preprint arXiv:1812.00271, 2018 - arxiv.org
Learning good representations is of crucial importance in deep learning. Mutual Information
(MI) or similar measures of statistical dependence are promising tools for learning these …
(MI) or similar measures of statistical dependence are promising tools for learning these …
Non-parameterized ship maneuvering model of Deep Neural Networks based on real voyage data-driven
Z Wang, J Kim, N Im - Ocean Engineering, 2023 - Elsevier
Abstract While Deep Neural Network (DNN) models for ship maneuvering model are
commonly constructed using experimental model ships or simulation data, this study focuses …
commonly constructed using experimental model ships or simulation data, this study focuses …
Improving speech recognition by revising gated recurrent units
Speech recognition is largely taking advantage of deep learning, showing that substantial
benefits can be obtained by modern Recurrent Neural Networks (RNNs). The most popular …
benefits can be obtained by modern Recurrent Neural Networks (RNNs). The most popular …
Autoencoder bottleneck features with multi-task optimisation for improved continuous dysarthric speech recognition
Automatic recognition of dysarthric speech is a very challenging research problem where
performances still lag far behind those achieved for typical speech. The main reason is the …
performances still lag far behind those achieved for typical speech. The main reason is the …
A network of deep neural networks for distant speech recognition
M Ravanelli, P Brakel, M Omologo… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Despite the remarkable progress recently made in distant speech recognition, state-of-the-
art technology still suffers from a lack of robustness, especially when adverse acoustic …
art technology still suffers from a lack of robustness, especially when adverse acoustic …