Pengenalan Viseme Dinamis Bahasa Indonesia Menggunakan Convolutional Neural Network

A Nasuha, TA Sardjono… - Jurnal Nasional Teknik …, 2018 - journal.ugm.ac.id
There has been very little researches on automatic lip reading in Indonesian language,
especially the ones based on dynamic visemes. To improve the accuracy of a recognition …

[PDF][PDF] A model of Indonesian dynamic visemes from facial motion capture database using a clustering-based approach

SS Arifin, M Muljono, M Hariadi - IAENG Int. J. Comput. Sci, 2017 - researchgate.net
Realistic 3D facial animation is a challenging task in the entertainment industries. One of the
efforts is to build a realistic lips animation. This research aims to build a model of Indonesian …

Automatic lip reading for daily Indonesian words based on frame difference and horizontal-vertical image projection

A Nasuha, F Arifin, TA Sardjono… - Journal of Theoretical …, 2017 - scholar.its.ac.id
Automatic lip reading is one of research being developed lately. Automatic lip reading has
been used for various purposes, such as enhancing speech recognition and aid to speech …

[PDF][PDF] Towards Automatic Mispronunciation Detection in Singing.

C Gupta, D Grunberg, P Rao, Y Wang - ISMIR, 2017 - ee.iitb.ac.in
ABSTRACT A tool for automatic pronunciation evaluation of singing is desirable for those
learning a second language. However, efforts to obtain pronunciation rules for such a tool …

Indonesian audio-visual speech corpus for multimodal automatic speech recognition

MRAR Maulana, MI Fanany - 2017 International Conference on …, 2017 - ieeexplore.ieee.org
Advancement of Automatic Speech Recognition (ASR) relies heavily on the availability of
the data, even more so for deep learning ASR system which is at the forefront of ASR …

Phoneme visem mapping for Marathi language using linguistic approach

A Brahme, U Bhadade - … on Global Trends in Signal Processing …, 2016 - ieeexplore.ieee.org
Visual speech is transcribed using visems. Visem is a set of visibly different phonemes in a
language. There is high correlation between Phoneme and visem and phoneme to visem …

Synthesis of Choir Songs Using MBROLA with Multiple Voices.

Y Suyanto - Engineering Letters, 2024 - search.ebscohost.com
Previous research has explored the synthesis of singing using the input of numbered
musical notation and song lyrics, with a primarily focus on solo singers. This study takes a …

Rule-based pronunciation models to handle oov words for indonesian automatic speech recognition system

FY Putri, D Hoesen, DP Lestari - 2019 5th International …, 2019 - ieeexplore.ieee.org
A representative pronunciation dictionary becomes a necessity to cover large vocabulary in
many domains. While creating a hand designed pronunciation dictionary in an extensive …

Phoneme-Viseme Mapping for Sinhala Speaking Robot for Sri Lankan Healthcare Applications

W Wakkumbura, RAH Madhubhashana… - 2022 IEEE 4th …, 2022 - ieeexplore.ieee.org
Speech perception is considered entirely as an auditory process, but vision also has a
significant influence on speech perception. In generating synthesized vocal systems …

Establishment of Indonesian viseme sequences using hidden markov model based on affection

E Setyati, O Susandono, L Zaman… - … Technology and Its …, 2017 - ieeexplore.ieee.org
Every language has different characteristics, one of which is how to pronounce the
language. Pronunciation accompanied by emotional expression are increasingly making …