[PDF][PDF] Development of Text and Speech database for Hindi and Indian English specific to Mobile Communication environment.
SS Agrawal, S Sinha, P Singh, JØ Olsen - LREC, 2012 - Citeseer
This paper describes the method and experiences of text and speech data collection in
mobile communication in Indian English Hindi. The primary data collection is done in the …
mobile communication in Indian English Hindi. The primary data collection is done in the …
Collaborative speech data acquisition for under resourced languages through crowdsourcing
Scarcity of resources in under resourced languages may leave these languages behind in
race of development of data driven NLP systems. Crowdsourcing has come up as a …
race of development of data driven NLP systems. Crowdsourcing has come up as a …
Development of Hindi mobile communication text and speech corpus
S Sinha, SS Agrawal, J Olsen - 2011 International Conference …, 2011 - ieeexplore.ieee.org
This paper describes the collection of a text and audio corpus for mobile personal
communication in Hindi. Hindi is the largest of the Indian languages, and is the first …
communication in Hindi. Hindi is the largest of the Indian languages, and is the first …
Development of Text and Speech Corpus for Designing the Multilingual Recognition System
S Bansal, SS Agrawal - 2018 Oriental COCOSDA-International …, 2018 - ieeexplore.ieee.org
To create the multilingual speech and text corpus manually is very difficult and time-
consuming task. This paper presents the overall methodology and experiences of text and …
consuming task. This paper presents the overall methodology and experiences of text and …
Corpus design and development of an annotated speech database for Punjabi
Punjabi is an important Indo-Aryan languages spoken in India and in some other countries
especially Pakistan. It is a tonal language and its phonetic and phonological aspects have …
especially Pakistan. It is a tonal language and its phonetic and phonological aspects have …
[PDF][PDF] SAMPA for Hindi and Punjabi based on their Acoustic and Phonetic Characteristics [C]
KK Arora, S Arora, SR Singla… - Proc. International Oriental …, 2007 - academia.edu
Abstract SAMPA (Speech Assessment Methods Phonetic Alphabet) is a machine readable
phonetic alphabet and hence facilitates easy processing of data for many applications in …
phonetic alphabet and hence facilitates easy processing of data for many applications in …
[PDF][PDF] Multilingual Crowdsourcing Methodology for Developing Resources for Under-resourced Indian Languages
Huge Data collection challenge gets intensified for Under Resources Languages especially
for large variety of Indian languages. We propose building of a common framework for …
for large variety of Indian languages. We propose building of a common framework for …
[PDF][PDF] A review on selection and correction of text and speech data for Indian languages
H Bharad, T Kodinariya - ETCEE–2015, 2015 - academia.edu
Today the word is going towards hands-free interfacing with machine using speech
commands and/or speech recognition. The computer understands binary or machine level …
commands and/or speech recognition. The computer understands binary or machine level …