A survey of code-switched speech and language processing
Code-switching, the alternation of languages within a conversation or utterance, is a
common communicative phenomenon that occurs in multilingual communities across the …
common communicative phenomenon that occurs in multilingual communities across the …
A semi-supervised approach to generate the code-mixed text using pre-trained encoder and transfer learning
Code-mixing, the interleaving of two or more languages within a sentence or discourse is
ubiquitous in multilingual societies. The lack of code-mixed training data is one of the major …
ubiquitous in multilingual societies. The lack of code-mixed training data is one of the major …
Lae: Language-aware encoder for monolingual and multilingual asr
Despite the rapid progress in automatic speech recognition (ASR) research, recognizing
multilingual speech using a unified ASR system remains highly challenging. Previous works …
multilingual speech using a unified ASR system remains highly challenging. Previous works …
Calcs 2021 shared task: Machine translation for code-switched data
To date, efforts in the code-switching literature have focused for the most part on language
identification, POS, NER, and syntactic parsing. In this paper, we address machine …
identification, POS, NER, and syntactic parsing. In this paper, we address machine …
Modeling code-switch languages using bilingual parallel corpus
Abstract Language modeling is the technique to estimate the probability of a sequence of
words. A bilingual language model is expected to model the sequential dependency for …
words. A bilingual language model is expected to model the sequential dependency for …
Training data augmentation for code-mixed translation
Abstract Machine translation of user-generated code-mixed inputs to English is of crucial
importance in applications like web search and targeted advertising. We address the …
importance in applications like web search and targeted advertising. We address the …
Can you traducir this? machine translation for code-switched input
Code-Switching (CSW) is a common phenomenon that occurs in multilingual geographic or
social contexts, which raises challenging problems for natural language processing tools …
social contexts, which raises challenging problems for natural language processing tools …
Joint modeling of code-switched and monolingual asr via conditional factorization
Conversational bilingual speech encompasses three types of utterances: two purely
monolingual types and one intra-sententially code-switched type. In this work, we propose a …
monolingual types and one intra-sententially code-switched type. In this work, we propose a …
Towards zero-shot code-switched speech recognition
In this work, we seek to build effective code-switched (CS) automatic speech recognition
systems (ASR) under the zero-shot set-ting where no transcribed CS speech data is …
systems (ASR) under the zero-shot set-ting where no transcribed CS speech data is …
Towards developing a multilingual and code-mixed visual question answering system by knowledge distillation
Pre-trained language-vision models have shown remarkable performance on the visual
question answering (VQA) task. However, most pre-trained models are trained by only …
question answering (VQA) task. However, most pre-trained models are trained by only …