A survey of code-switched speech and language processing

S Sitaram, KR Chandu, SK Rallabandi… - arXiv preprint arXiv …, 2019 - arxiv.org
Code-switching, the alternation of languages within a conversation or utterance, is a
common communicative phenomenon that occurs in multilingual communities across the …

A semi-supervised approach to generate the code-mixed text using pre-trained encoder and transfer learning

D Gupta, A Ekbal, P Bhattacharyya - Findings of the Association …, 2020 - aclanthology.org
Code-mixing, the interleaving of two or more languages within a sentence or discourse is
ubiquitous in multilingual societies. The lack of code-mixed training data is one of the major …

Lae: Language-aware encoder for monolingual and multilingual asr

J Tian, J Yu, C Zhang, C Weng, Y Zou, D Yu - arXiv preprint arXiv …, 2022 - arxiv.org
Despite the rapid progress in automatic speech recognition (ASR) research, recognizing
multilingual speech using a unified ASR system remains highly challenging. Previous works …

Calcs 2021 shared task: Machine translation for code-switched data

S Chen, G Aguilar, A Srinivasan, M Diab… - arXiv preprint arXiv …, 2022 - arxiv.org
To date, efforts in the code-switching literature have focused for the most part on language
identification, POS, NER, and syntactic parsing. In this paper, we address machine …

Modeling code-switch languages using bilingual parallel corpus

G Lee, H Li - Proceedings of the 58th Annual Meeting of the …, 2020 - aclanthology.org
Abstract Language modeling is the technique to estimate the probability of a sequence of
words. A bilingual language model is expected to model the sequential dependency for …

Training data augmentation for code-mixed translation

A Gupta, A Vavre, S Sarawagi - … of the 2021 Conference of the …, 2021 - aclanthology.org
Abstract Machine translation of user-generated code-mixed inputs to English is of crucial
importance in applications like web search and targeted advertising. We address the …

Can you traducir this? machine translation for code-switched input

J Xu, F Yvon - arXiv preprint arXiv:2105.04846, 2021 - arxiv.org
Code-Switching (CSW) is a common phenomenon that occurs in multilingual geographic or
social contexts, which raises challenging problems for natural language processing tools …

Joint modeling of code-switched and monolingual asr via conditional factorization

B Yan, C Zhang, M Yu, SX Zhang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Conversational bilingual speech encompasses three types of utterances: two purely
monolingual types and one intra-sententially code-switched type. In this work, we propose a …

Towards zero-shot code-switched speech recognition

B Yan, M Wiesner, O Klejch, P Jyothi… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this work, we seek to build effective code-switched (CS) automatic speech recognition
systems (ASR) under the zero-shot set-ting where no transcribed CS speech data is …

Towards developing a multilingual and code-mixed visual question answering system by knowledge distillation

HR Khan, D Gupta, A Ekbal - arXiv preprint arXiv:2109.04653, 2021 - arxiv.org
Pre-trained language-vision models have shown remarkable performance on the visual
question answering (VQA) task. However, most pre-trained models are trained by only …