Indicvoices-r: Unlocking a massive multilingual multi-speaker speech corpus for scaling indian TTS

A Sankar, S Anand, PS Varadhan, S Thomas… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in text-to-speech (TTS) synthesis show that large-scale models
trained with extensive web data produce highly natural-sounding output. However, such …

Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies

S Anand, PS Varadhan, A Sankar, G Raju… - arXiv preprint arXiv …, 2024 - arxiv.org
Publicly available TTS datasets for low-resource languages like Hindi and Tamil typically
contain 10-20 hours of data, leading to poor vocabulary coverage. This limitation becomes …

AI-Powered Real-Time Speech-to-Speech Translation for Virtual Meetings Using Machine Learning Models

S Karunya, M Jalakandeshwaran… - … and Control for …, 2023 - ieeexplore.ieee.org
In our interconnected world, language diversity poses communication challenges,
particularly in virtual meetings. Our solution, a Real-Time Speech-to-Speech Translation …

Everyday Speech in the Indian Subcontinent

U Pathak, CSK Gunda, S Sathiyamoorthy… - arXiv preprint arXiv …, 2024 - arxiv.org
India has 1369 languages of which 22 are official. About 13 different scripts are used to
represent these languages. A Common Label Set (CLS) was developed based on phonetics …

ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams

S Anand, PS Varadhan, M Singal… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in Text-to-Speech (TTS) technology have led to natural-sounding
speech for English, primarily due to the availability of large-scale, high-quality web data …

Lightweight, Multi-speaker, Multi-lingual Indic Text-To-Speech

A Singh, A Nagireddi, A Jayakumar… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org
The Lightweight, Multi-speaker, Multi-lingual Indic Text-to-Speech (LIMMITS'23) challenge is
organized as part of the ICASSP 2023 Signal Processing Grand Challenge. LIMMITS'23 …

MunTTS: A Text-to-Speech System for Mundari

V Gumma, R Hada, A Yadavalli, P Gogoi… - arXiv preprint arXiv …, 2024 - arxiv.org
We present MunTTS, an end-to-end text-to-speech (TTS) system specifically for Mundari, a
low-resource Indian language of the Austo-Asiatic family. Our work addresses the gap in …

[PDF][PDF] IndicMOS: Multilingual MOS Prediction for 7 Indian languages

S Udupa, S Maiti, PK Ghosh - Proc. Interspeech 2024, 2024 - isca-archive.org
Subjective evaluation is the gold standard for the evaluation of speech in different tasks such
as text-to-speech (TTS), and voice-cloning (VC). However, these evaluations can be costly …

Exploring Solutions for Text-to-Speech Synthesis of Low-Resource Languages

AR Gladston, KV Pradeep - 2023 4th International Conference …, 2023 - ieeexplore.ieee.org
A text-to-speech synthesis system is expected to convert any given text to highly intelligible
and natural speech that sounds as human-like as possible. It finds its place in a variety of …

Recent Trends in Text to Speech Synthesis in Context with Indian Languages

M Gupta, A Dev, P Bansal - International Conference on Artificial …, 2023 - Springer
The most important type of communication in daily life is speech. However, for those who are
physically or visually challenged or illiterate, using computers is confusing due to the …