Language modeling for code-mixing: The role of linguistic theory based synthetic data A Pratapa, G Bhat, M Choudhury, S Sitaram, S Dandapat, K Bali Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018 | 150 | 2018 |
Mega: Multilingual evaluation of generative ai K Ahuja, H Diddee, R Hada, M Ochieng, K Ramesh, P Jain, A Nambi, ... arXiv preprint arXiv:2303.12528, 2023 | 124 | 2023 |
A survey of code-switched speech and language processing S Sitaram, KR Chandu, SK Rallabandi, AW Black arXiv preprint arXiv:1904.00784, 2019 | 124 | 2019 |
GLUECoS: An evaluation benchmark for code-switched NLP S Khanuja, S Dandapat, A Srinivasan, S Sitaram, M Choudhury arXiv preprint arXiv:2004.12376, 2020 | 112 | 2020 |
Multilingual and code-switching ASR challenges for low resource Indian languages A Diwan, R Vaideeswaran, S Shah, A Singh, S Raghavan, S Khare, ... arXiv preprint arXiv:2104.00235, 2021 | 79 | 2021 |
Interspeech 2018 Low Resource Automatic Speech Recognition Challenge for Indian Languages BML Srivastava, S Sitaram, RK Mehta, KD Mohan, P Matani, S Satpal, ... Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under …, 2018 | 71 | 2018 |
Word embeddings for code-mixed language processing A Pratapa, M Choudhury, S Sitaram Proceedings of the 2018 conference on empirical methods in natural language …, 2018 | 69 | 2018 |
Polyglot neural language models: A case study in cross-lingual phonetic representation learning Y Tsvetkov, S Sitaram, M Faruqui, G Lample, P Littell, D Mortensen, ... arXiv preprint arXiv:1605.03832, 2016 | 67 | 2016 |
A survey of code-switching: Linguistic and social perspectives for language technologies AS Doğruöz, S Sitaram, BE Bullock, AJ Toribio arXiv preprint arXiv:2301.01967, 2023 | 55 | 2023 |
Crowdsourcing speech data for low-resource languages from low-income workers B Abraham, D Goel, D Siddarth, K Bali, M Chopra, M Choudhury, P Joshi, ... Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 41 | 2020 |
Curriculum design for code-switching: Experiments with language identification and language modeling with deep neural networks M Choudhury, K Bali, S Sitaram, A Baheti Proceedings of the 14th International Conference on Natural Language …, 2017 | 41 | 2017 |
Speech synthesis of code-mixed text S Sitaram, AW Black Proceedings of the Tenth International Conference on Language Resources and …, 2016 | 41 | 2016 |
Unsung challenges of building and deploying language technologies for low resource language communities P Joshi, C Barnes, S Santy, S Khanuja, S Shah, A Srinivasan, ... arXiv preprint arXiv:1912.03457, 2019 | 39 | 2019 |
A hindi speech recognizer for an agricultural video search application K Bali, S Sitaram, S Cuendet, I Medhi Proceedings of the 3rd ACM Symposium on Computing for Development, 1-8, 2013 | 38 | 2013 |
GCM: A toolkit for generating synthetic code-mixed text MSZ Rizvi, A Srinivasan, T Ganu, M Choudhury, S Sitaram Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 37 | 2021 |
Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text. S Sitaram, SK Rallabandi, S Rijhwani, AW Black SSW, 76-81, 2016 | 37 | 2016 |
Phone merging for code-switched speech recognition S Sivasankaran, BML Srivastava, S Sitaram, K Bali, M Choudhury Third workshop on computational approaches to linguistic code-switching, 2018 | 35 | 2018 |
A new dataset for natural language inference from code-mixed conversations S Khanuja, S Dandapat, S Sitaram, M Choudhury arXiv preprint arXiv:2004.05051, 2020 | 33 | 2020 |
Two methods for assessing oral reading prosody M Duong, J Mostow, S Sitaram ACM Transactions on Speech and Language Processing (TSLP) 7 (4), 1-22, 2011 | 31 | 2011 |
Bootstrapping text-to-speech for speech processing in languages without an orthography S Sitaram, S Palkar, YN Chen, A Parlikar, AW Black 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 29 | 2013 |