Batch-normalized joint training for DNN-based distant speech recognition

M Ravanelli, T Parcollet, P Plantinga, A Rouhe… - arXiv preprint arXiv …, 2021 - arxiv.org

SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …

被引用次数：725 相关文章所有 5 个版本

[PDF] researchgate.net

Speaker recognition from raw waveform with sincnet

M Ravanelli, Y Bengio - 2018 IEEE spoken language …, 2018 - ieeexplore.ieee.org

Deep learning is progressively gaining popularity as a viable alternative to i-vectors for
speaker recognition. Promising results have been recently obtained with Convolutional …

被引用次数：997 相关文章所有 10 个版本

[PDF] arxiv.org

Light gated recurrent units for speech recognition

M Ravanelli, P Brakel, M Omologo… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org

A field that has directly benefited from the recent advances in deep learning is automatic
speech recognition (ASR). Despite the great achievements of the past decades, however, a …

被引用次数：443 相关文章所有 7 个版本

[PDF] arxiv.org

The pytorch-kaldi speech recognition toolkit

M Ravanelli, T Parcollet… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org

The availability of open-source software is playing a remarkable role in the popularization of
speech recognition and deep learning. Kaldi, for instance, is nowadays an established …

被引用次数：297 相关文章所有 9 个版本

[PDF] researchgate.net

Interpretable convolutional filters with sincnet

M Ravanelli, Y Bengio - arXiv preprint arXiv:1811.09725, 2018 - arxiv.org

Deep learning is currently playing a crucial role toward higher levels of artificial intelligence.
This paradigm allows neural networks to learn complex and abstract representations, that …

被引用次数：149 相关文章所有 5 个版本

[PDF] arxiv.org

Learning speaker representations with mutual information

M Ravanelli, Y Bengio - arXiv preprint arXiv:1812.00271, 2018 - arxiv.org

Learning good representations is of crucial importance in deep learning. Mutual Information
(MI) or similar measures of statistical dependence are promising tools for learning these …

被引用次数：107 相关文章所有 9 个版本

Non-parameterized ship maneuvering model of Deep Neural Networks based on real voyage data-driven

Z Wang, J Kim, N Im - Ocean Engineering, 2023 - Elsevier

Abstract While Deep Neural Network (DNN) models for ship maneuvering model are
commonly constructed using experimental model ships or simulation data, this study focuses …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Improving speech recognition by revising gated recurrent units

M Ravanelli, P Brakel, M Omologo, Y Bengio - arXiv preprint arXiv …, 2017 - arxiv.org

Speech recognition is largely taking advantage of deep learning, showing that substantial
benefits can be obtained by modern Recurrent Neural Networks (RNNs). The most popular …

被引用次数：71 相关文章所有 11 个版本

[PDF] whiterose.ac.uk

Autoencoder bottleneck features with multi-task optimisation for improved continuous dysarthric speech recognition

Z Yue, H Christensen, J Barker - Proceedings of Interspeech …, 2020 - eprints.whiterose.ac.uk

Automatic recognition of dysarthric speech is a very challenging research problem where
performances still lag far behind those achieved for typical speech. The main reason is the …

被引用次数：29 相关文章所有 8 个版本

[PDF] arxiv.org

A network of deep neural networks for distant speech recognition

M Ravanelli, P Brakel, M Omologo… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org

Despite the remarkable progress recently made in distant speech recognition, state-of-the-
art technology still suffers from a lack of robustness, especially when adverse acoustic …

被引用次数：54 相关文章所有 8 个版本