SpeechBrain: A general-purpose speech toolkit

M Ravanelli, T Parcollet, P Plantinga, A Rouhe… - arXiv preprint arXiv …, 2021 - arxiv.org
SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …

Speaker recognition from raw waveform with sincnet

M Ravanelli, Y Bengio - 2018 IEEE spoken language …, 2018 - ieeexplore.ieee.org
Deep learning is progressively gaining popularity as a viable alternative to i-vectors for
speaker recognition. Promising results have been recently obtained with Convolutional …

Light gated recurrent units for speech recognition

M Ravanelli, P Brakel, M Omologo… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
A field that has directly benefited from the recent advances in deep learning is automatic
speech recognition (ASR). Despite the great achievements of the past decades, however, a …

The pytorch-kaldi speech recognition toolkit

M Ravanelli, T Parcollet… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
The availability of open-source software is playing a remarkable role in the popularization of
speech recognition and deep learning. Kaldi, for instance, is nowadays an established …

Interpretable convolutional filters with sincnet

M Ravanelli, Y Bengio - arXiv preprint arXiv:1811.09725, 2018 - arxiv.org
Deep learning is currently playing a crucial role toward higher levels of artificial intelligence.
This paradigm allows neural networks to learn complex and abstract representations, that …

Learning speaker representations with mutual information

M Ravanelli, Y Bengio - arXiv preprint arXiv:1812.00271, 2018 - arxiv.org
Learning good representations is of crucial importance in deep learning. Mutual Information
(MI) or similar measures of statistical dependence are promising tools for learning these …

Non-parameterized ship maneuvering model of Deep Neural Networks based on real voyage data-driven

Z Wang, J Kim, N Im - Ocean Engineering, 2023 - Elsevier
Abstract While Deep Neural Network (DNN) models for ship maneuvering model are
commonly constructed using experimental model ships or simulation data, this study focuses …

Improving speech recognition by revising gated recurrent units

M Ravanelli, P Brakel, M Omologo, Y Bengio - arXiv preprint arXiv …, 2017 - arxiv.org
Speech recognition is largely taking advantage of deep learning, showing that substantial
benefits can be obtained by modern Recurrent Neural Networks (RNNs). The most popular …

Autoencoder bottleneck features with multi-task optimisation for improved continuous dysarthric speech recognition

Z Yue, H Christensen, J Barker - Proceedings of Interspeech …, 2020 - eprints.whiterose.ac.uk
Automatic recognition of dysarthric speech is a very challenging research problem where
performances still lag far behind those achieved for typical speech. The main reason is the …

A network of deep neural networks for distant speech recognition

M Ravanelli, P Brakel, M Omologo… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Despite the remarkable progress recently made in distant speech recognition, state-of-the-
art technology still suffers from a lack of robustness, especially when adverse acoustic …