Decolonising speech and language technology

S Bird - 28th International Conference on Computational …, 2020 - researchers.cdu.edu.au
After generations of exploitation, Indigenous people often respond negatively to the idea that
their languages are data ready for the taking. By treating Indigenous knowledge as a …

Sequence-to-sequence models can directly translate foreign speech

RJ Weiss, J Chorowski, N Jaitly, Y Wu… - arXiv preprint arXiv …, 2017 - arxiv.org
We present a recurrent encoder-decoder deep neural network architecture that directly
translates speech in one language into text in another. The model does not explicitly …

ESPnet-ST: All-in-one speech translation toolkit

H Inaguma, S Kiyono, K Duh, S Karita… - arXiv preprint arXiv …, 2020 - arxiv.org
We present ESPnet-ST, which is designed for the quick development of speech-to-speech
translation systems in a single framework. ESPnet-ST is a new project inside end-to-end …

Tied multitask learning for neural speech translation

A Anastasopoulos, D Chiang - arXiv preprint arXiv:1802.06655, 2018 - arxiv.org
We explore multitask models for neural translation of speech, augmenting them in order to
reflect two intuitive notions. First, we introduce a model where the second task decoder …

[PDF][PDF] An attentional model for speech translation without transcription

L Duong, A Anastasopoulos, D Chiang… - Proceedings of the …, 2016 - aclanthology.org
For many low-resource languages, spoken language resources are more likely to be
annotated with translations than transcriptions. This bilingual speech data can be used for …

Must NLP be Extractive?

S Bird - 62nd Annual Meeting of the Association for …, 2024 - researchers.cdu.edu.au
How do we roll out language technologies across a world with 7,000 languages? In one
story, we scale the successes of NLP further into'low-resource'languages, doing ever more …

Guiding principles for participatory design-inspired natural language processing

T Caselli, R Cibin, C Conforti, E Encinas… - Proceedings of the 1st …, 2021 - research.rug.nl
We introduce 9 guiding principles 1 to integrate Participatory Design (PD) methods in the
development of Natural Language Processing (NLP) systems. The adoption of PD methods …

Parallel speech collection for under-resourced language studies using the LIG-Aikuma mobile device app

D Blachon, E Gauthier, L Besacier, GN Kouarata… - Procedia Computer …, 2016 - Elsevier
This paper reports on our ongoing efforts to collect speech data in under-resourced or
endangered languages of Africa. Data collection is carried out using an improved version of …

Automatic interlinear glossing for under-resourced languages leveraging translations

X Zhao, S Ozaki, A Anastasopoulos… - Proceedings of the …, 2020 - aclanthology.org
Abstract Interlinear Glossed Text (IGT) is a widely used format for encoding linguistic
information in language documentation projects and scholarly papers. Manual production of …

Hitting the 'pause'button: What does COVID-19 tell us about the future of heritage sounds?

DHR Spennemann, M Parker - Noise Mapping, 2020 - degruyter.com
Human existence is accompanied by environmental sounds as by-products of people's
activities and sounds that are intentionally generated to allow human society to function. The …