Findings of the IWSLT 2022 Evaluation Campaign.
The evaluation campaign of the 19th International Conference on Spoken Language
Translation featured eight shared tasks:(i) Simultaneous speech translation,(ii) Offline …
Translation featured eight shared tasks:(i) Simultaneous speech translation,(ii) Offline …
The multilingual tedx corpus for speech recognition and translation
We present the Multilingual TEDx corpus, built to support speech recognition (ASR) and
speech translation (ST) research across many non-English source languages. The corpus is …
speech translation (ST) research across many non-English source languages. The corpus is …
Understanding the brain with attention: A survey of transformers in brain sciences
Owing to their superior capabilities and advanced achievements, Transformers have
gradually attracted attention with regard to understanding complex brain processing …
gradually attracted attention with regard to understanding complex brain processing …
Adaptive multilingual speech recognition with pretrained models
Multilingual speech recognition with supervised learning has achieved great results as
reflected in recent research. With the development of pretraining methods on audio and text …
reflected in recent research. With the development of pretraining methods on audio and text …
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
This paper proposes a novel direct Audio-Visual Speech to Audio-Visual Speech
Translation (AV2AV) framework where the input and output of the system are multimodal (ie …
Translation (AV2AV) framework where the input and output of the system are multimodal (ie …
From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers
In this paper we provide, to the best of our knowledge, the first comprehensive approach for
incorporating various masking mechanisms into Transformers architectures in a scalable …
incorporating various masking mechanisms into Transformers architectures in a scalable …
Incorporating relative position information in transformer-based sign language recognition and translation
Recent advancements in machine translation tasks, with the advent of attention mechanisms
and Transformer networks, have accelerated the research in Sign Language Translation …
and Transformer networks, have accelerated the research in Sign Language Translation …
[HTML][HTML] A reverse positional encoding multi-head attention-based neural machine translation model for arabic dialects
Languages with a grammatical structure that have a free order for words, such as Arabic
dialects, are considered a challenge for neural machine translation (NMT) models because …
dialects, are considered a challenge for neural machine translation (NMT) models because …
ESPnet-ST IWSLT 2021 offline speech translation system
This paper describes the ESPnet-ST group's IWSLT 2021 submission in the offline speech
translation track. This year we made various efforts on training data, architecture, and audio …
translation track. This year we made various efforts on training data, architecture, and audio …
Variable attention masking for configurable transformer transducer speech recognition
This work studies the use of attention masking in transformer transducer based speech
recognition for building a single configurable model for different deployment scenarios. We …
recognition for building a single configurable model for different deployment scenarios. We …