Pengenalan Viseme Dinamis Bahasa Indonesia Menggunakan Convolutional Neural Network
A Nasuha, TA Sardjono… - Jurnal Nasional Teknik …, 2018 - journal.ugm.ac.id
There has been very little researches on automatic lip reading in Indonesian language,
especially the ones based on dynamic visemes. To improve the accuracy of a recognition …
especially the ones based on dynamic visemes. To improve the accuracy of a recognition …
[PDF][PDF] A model of Indonesian dynamic visemes from facial motion capture database using a clustering-based approach
Realistic 3D facial animation is a challenging task in the entertainment industries. One of the
efforts is to build a realistic lips animation. This research aims to build a model of Indonesian …
efforts is to build a realistic lips animation. This research aims to build a model of Indonesian …
Automatic lip reading for daily Indonesian words based on frame difference and horizontal-vertical image projection
Automatic lip reading is one of research being developed lately. Automatic lip reading has
been used for various purposes, such as enhancing speech recognition and aid to speech …
been used for various purposes, such as enhancing speech recognition and aid to speech …
[PDF][PDF] Towards Automatic Mispronunciation Detection in Singing.
ABSTRACT A tool for automatic pronunciation evaluation of singing is desirable for those
learning a second language. However, efforts to obtain pronunciation rules for such a tool …
learning a second language. However, efforts to obtain pronunciation rules for such a tool …
Indonesian audio-visual speech corpus for multimodal automatic speech recognition
MRAR Maulana, MI Fanany - 2017 International Conference on …, 2017 - ieeexplore.ieee.org
Advancement of Automatic Speech Recognition (ASR) relies heavily on the availability of
the data, even more so for deep learning ASR system which is at the forefront of ASR …
the data, even more so for deep learning ASR system which is at the forefront of ASR …
Phoneme visem mapping for Marathi language using linguistic approach
Visual speech is transcribed using visems. Visem is a set of visibly different phonemes in a
language. There is high correlation between Phoneme and visem and phoneme to visem …
language. There is high correlation between Phoneme and visem and phoneme to visem …
Synthesis of Choir Songs Using MBROLA with Multiple Voices.
Y Suyanto - Engineering Letters, 2024 - search.ebscohost.com
Previous research has explored the synthesis of singing using the input of numbered
musical notation and song lyrics, with a primarily focus on solo singers. This study takes a …
musical notation and song lyrics, with a primarily focus on solo singers. This study takes a …
Rule-based pronunciation models to handle oov words for indonesian automatic speech recognition system
FY Putri, D Hoesen, DP Lestari - 2019 5th International …, 2019 - ieeexplore.ieee.org
A representative pronunciation dictionary becomes a necessity to cover large vocabulary in
many domains. While creating a hand designed pronunciation dictionary in an extensive …
many domains. While creating a hand designed pronunciation dictionary in an extensive …
Phoneme-Viseme Mapping for Sinhala Speaking Robot for Sri Lankan Healthcare Applications
W Wakkumbura, RAH Madhubhashana… - 2022 IEEE 4th …, 2022 - ieeexplore.ieee.org
Speech perception is considered entirely as an auditory process, but vision also has a
significant influence on speech perception. In generating synthesized vocal systems …
significant influence on speech perception. In generating synthesized vocal systems …
Establishment of Indonesian viseme sequences using hidden markov model based on affection
Every language has different characteristics, one of which is how to pronounce the
language. Pronunciation accompanied by emotional expression are increasingly making …
language. Pronunciation accompanied by emotional expression are increasingly making …