Timers and such: A practical benchmark for spoken language understanding with numbers

M Ravanelli, T Parcollet, P Plantinga, A Rouhe… - arXiv preprint arXiv …, 2021 - arxiv.org

SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …

被引用次数：580 相关文章所有 5 个版本

[PDF] arxiv.org

A fine-tuned wav2vec 2.0/hubert benchmark for speech emotion recognition, speaker verification and spoken language understanding

Y Wang, A Boumadane, A Heba - arXiv preprint arXiv:2111.02735, 2021 - arxiv.org

Speech self-supervised models such as wav2vec 2.0 and HuBERT are making revolutionary
progress in Automatic Speech Recognition (ASR). However, they have not been totally …

被引用次数：138 相关文章所有 3 个版本

[PDF] arxiv.org

SLUE phase-2: A benchmark suite of diverse spoken language understanding tasks

S Shon, S Arora, CJ Lin, A Pasad, F Wu… - arXiv preprint arXiv …, 2022 - arxiv.org

Spoken language understanding (SLU) tasks have been studied for many decades in the
speech research community, but have not received as much attention as lower-level tasks …

被引用次数：24 相关文章所有 7 个版本

[PDF] arxiv.org

Match to win: Analysing sequences lengths for efficient self-supervised learning in speech and audio

Y Gaol, J Fernandez-Marques… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

Self-supervised learning (SSL) has proven vital in speech and audio-related applications.
The paradigm trains a general model on unlabeled data that can later be used to solve …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

Finstreder: simple and fast spoken language understanding with finite state transducers using modern speech-to-text models

D Bermuth, A Poeppel, W Reif - arXiv preprint arXiv:2206.14589, 2022 - arxiv.org

In Spoken Language Understanding (SLU) the task is to extract important information from
audio commands, like the intent of what a user wants the system to do and special entities …

被引用次数：6 相关文章所有 5 个版本

[PDF] aclanthology.org

TARIC-SLU: A Tunisian Benchmark Dataset For Spoken Language Understanding

S Mdhaffar, F Bougares, R De Mori… - Proceedings of the …, 2024 - aclanthology.org

In recent years, there has been a significant increase in interest in developing Spoken
Language Understanding (SLU) systems. SLU involves extracting a list of semantic …

被引用次数：1 相关文章

[PDF] arxiv.org