Phonological similarity-based backoff smoothing to boost a bigram syllable boundary detection

S Suyanto - International Journal of Speech Technology, 2020 - Springer
Swapping one or more consonant-graphemes in a word into other phonologically similar
ones, which based on both place and manner of articulation, interestingly produces some …

Data augmentation methods for low-resource orthographic syllabification

S Suyanto, KM Lhaksmana, MA Bijaksana… - IEEE …, 2020 - ieeexplore.ieee.org
An n-gram syllabification model generally produces a high error rate for a low-resource
language, such as Indonesian, because of the high rate of out-of-vocabulary (OOV) n-grams …

Overcoming data sparsity in automatic transcription of dictated medical findings

E Pakoci, D Pekar, B Popović… - 2022 30th European …, 2022 - ieeexplore.ieee.org
This paper presents a method for introducing class n-gram language models as a means for
overcoming data sparsity in the training of an automatic speech recognition (ASR) system …

Quantitative analysis of the morphological complexity of Malayalam language

K Manohar, AR Jayan, R Rajan - … , Brno, Czech Republic, September 8–11 …, 2020 - Springer
This paper presents a quantitative analysis on the morphological complexity of Malayalam
language. Malayalam is a Dravidian language spoken in India, predominantly in the state of …

Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

AM Fanani, S Suyanto - Procedia Computer Science, 2021 - Elsevier
Syllabication or syllabification is an activity to detect syllable boundaries in a word. There
are two main ways for automatic syllabification, namely rule-based and data-driven. The rule …

Recurrent neural networks and morphological features in language modeling for Serbian

ET Pakoci, BZ Popović - 2021 29th Telecommunications Forum …, 2021 - ieeexplore.ieee.org
This paper describes the current state-of-the-art language model for the Serbian language,
and also a specific way of dealing with one of the issues that is present in Serbian automatic …

Towards more flexible human-machine speech communication

M Sečujski, N Jakovljević, N Simić… - 2023 31st …, 2023 - ieeexplore.ieee.org
The research presented in the paper addresses challenges related to the development of
more flexible systems for speech communication between humans and machines …

Methods for using class based n-gram language models in the Kaldi toolkit

E Pakoci, B Popović - International Conference on Speech and Computer, 2021 - Springer
This paper explains in detail several methods for utilization of class based n-gram language
models for automatic speech recognition, within the Kaldi speech recognition framework. It …

Transfer leaming in automatic speech recognition for Serbian

B Popović, E Pakoci, D Pekar - 2019 27th Telecommunications …, 2019 - ieeexplore.ieee.org
In automatic speech recognition systems the training data used for system development and
data expected to be obtained during the practical use of the system do not have to fit each …

[PDF][PDF] Automatic speech recognition system for dictating medical findings

B Popović, E Pakoci, D Pekar - Proc. of IcETRAN, 2020 - etran.rs
The paper presents an automatic speech recognition (ASR) system for dictating medical
findings, developed by AlfaNum–Speech Technologies Ltd for the Pension and Disability …