Decolonising speech and language technology
S Bird - 28th International Conference on Computational …, 2020 - researchers.cdu.edu.au
After generations of exploitation, Indigenous people often respond negatively to the idea that
their languages are data ready for the taking. By treating Indigenous knowledge as a …
their languages are data ready for the taking. By treating Indigenous knowledge as a …
Sequence-to-sequence models can directly translate foreign speech
We present a recurrent encoder-decoder deep neural network architecture that directly
translates speech in one language into text in another. The model does not explicitly …
translates speech in one language into text in another. The model does not explicitly …
ESPnet-ST: All-in-one speech translation toolkit
We present ESPnet-ST, which is designed for the quick development of speech-to-speech
translation systems in a single framework. ESPnet-ST is a new project inside end-to-end …
translation systems in a single framework. ESPnet-ST is a new project inside end-to-end …
Tied multitask learning for neural speech translation
A Anastasopoulos, D Chiang - arXiv preprint arXiv:1802.06655, 2018 - arxiv.org
We explore multitask models for neural translation of speech, augmenting them in order to
reflect two intuitive notions. First, we introduce a model where the second task decoder …
reflect two intuitive notions. First, we introduce a model where the second task decoder …
[PDF][PDF] An attentional model for speech translation without transcription
For many low-resource languages, spoken language resources are more likely to be
annotated with translations than transcriptions. This bilingual speech data can be used for …
annotated with translations than transcriptions. This bilingual speech data can be used for …
Must NLP be Extractive?
S Bird - 62nd Annual Meeting of the Association for …, 2024 - researchers.cdu.edu.au
How do we roll out language technologies across a world with 7,000 languages? In one
story, we scale the successes of NLP further into'low-resource'languages, doing ever more …
story, we scale the successes of NLP further into'low-resource'languages, doing ever more …
Guiding principles for participatory design-inspired natural language processing
We introduce 9 guiding principles 1 to integrate Participatory Design (PD) methods in the
development of Natural Language Processing (NLP) systems. The adoption of PD methods …
development of Natural Language Processing (NLP) systems. The adoption of PD methods …
Parallel speech collection for under-resourced language studies using the LIG-Aikuma mobile device app
D Blachon, E Gauthier, L Besacier, GN Kouarata… - Procedia Computer …, 2016 - Elsevier
This paper reports on our ongoing efforts to collect speech data in under-resourced or
endangered languages of Africa. Data collection is carried out using an improved version of …
endangered languages of Africa. Data collection is carried out using an improved version of …
Automatic interlinear glossing for under-resourced languages leveraging translations
X Zhao, S Ozaki, A Anastasopoulos… - Proceedings of the …, 2020 - aclanthology.org
Abstract Interlinear Glossed Text (IGT) is a widely used format for encoding linguistic
information in language documentation projects and scholarly papers. Manual production of …
information in language documentation projects and scholarly papers. Manual production of …
Hitting the 'pause'button: What does COVID-19 tell us about the future of heritage sounds?
DHR Spennemann, M Parker - Noise Mapping, 2020 - degruyter.com
Human existence is accompanied by environmental sounds as by-products of people's
activities and sounds that are intentionally generated to allow human society to function. The …
activities and sounds that are intentionally generated to allow human society to function. The …