DNN-based ultrasound-to-speech conversion for a silent speech interface TG Csapó, T Grósz, G Gosztolya, L Tóth, A Markó International Speech Communication Association (ISCA), 2017 | 72 | 2017 |
A comparative study on the contour tracking algorithms in ultrasound tongue images with automatic re-initialization K Xu, T Gábor Csapó, P Roussel, B Denby The Journal of the Acoustical Society of America 139 (5), EL154-EL160, 2016 | 48 | 2016 |
Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images K Xu, P Roussel, TG Csapó, B Denby The Journal of the Acoustical Society of America 141 (6), EL531-EL537, 2017 | 47 | 2017 |
Speech-centric multimodal interaction for easy-to-access online services–a personal life assistant for the elderly A Teixeira, A Hämäläinen, J Avelar, N Almeida, G Németh, T Fegyó, ... Procedia computer science 27, 389-397, 2014 | 47 | 2014 |
Error analysis of extracted tongue contours from 2D ultrasound images TG Csapó, SM Lulich Interspeech, 2157-2161, 2015 | 39 | 2015 |
Multi-Task Learning of Speech Recognition and Speech Synthesis Parameters for Ultrasound-based Silent Speech Interfaces. L Tóth, G Gosztolya, T Grósz, A Markó, TG Csapó INTERSPEECH, 3172-3176, 2018 | 36 | 2018 |
F0 estimation for DNN-based ultrasound silent speech interfaces T Grósz, G Gosztolya, L Tóth, TG Csapó, A Markó 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 34 | 2018 |
DNN-based acoustic-to-articulatory inversion using ultrasound tongue imaging D Porras, A Sepúlveda-Sepúlveda, TG Csapó 2019 International Joint Conference on Neural Networks (IJCNN), 1-8, 2019 | 32 | 2019 |
Ultrasound-based articulatory-to-acoustic mapping with WaveGlow speech synthesis TG Csapó, C Zainkó, L Tóth, G Gosztolya, A Markó arXiv preprint arXiv:2008.03152, 2020 | 27 | 2020 |
Modeling unvoiced sounds in statistical parametric speech synthesis with a continuous vocoder TG Csapó, G Németh, M Cernak, PN Garner 2016 24th European Signal Processing Conference (EUSIPCO), 1338-1342, 2016 | 27 | 2016 |
Time-Domain Envelope Modulating the Noise Component of Excitation in a Continuous Residual-Based Vocoder for Statistical Parametric Speech Synthesis. MS Al-Radhi, TG Csapó, G Németh Interspeech, 434-438, 2017 | 26 | 2017 |
Ultrasound-based silent speech interface using convolutional and recurrent neural networks EM Juanpere, TG Csapó Acta Acustica united with Acustica 105 (4), 587-590, 2019 | 24 | 2019 |
Increasing prosodic variability of text-to-speech synthesizers. G Németh, M Fék, TG Csapó INTERSPEECH, 474-477, 2007 | 24 | 2007 |
Ultrasound-based Silent Speech Interface Built on a Continuous Vocoder T Gábor Csapó, M Salah Al-Radhi, G Németh, G Gosztolya, T Grósz, ... arXiv e-prints, arXiv: 1906.09885, 2019 | 21* | 2019 |
Synthesizing expressive speech from amateur audiobook recordings É Székely, TG Csapó, B Tóth, P Mihajlik, J Carson-Berndsen 2012 IEEE Spoken Language Technology Workshop (SLT), 297-302, 2012 | 21 | 2012 |
Residual-based excitation with continuous F0 modeling in HMM-based speech synthesis TG Csapó, G Németh, M Cernak Statistical Language and Speech Processing: Third International Conference …, 2015 | 19 | 2015 |
Relation of formants and subglottal resonances in Hungarian vowels TG Csapó, Z Bárkányi, TE Gráczi, T Bőhm, SM Lulich Tenth Annual Conference of the International Speech Communication Association, 2009 | 19 | 2009 |
Autoencoder-based articulatory-to-acoustic mapping for ultrasound silent speech interfaces G Gosztolya, Á Pintér, L Tóth, T Grósz, A Markó, TG Csapó 2019 International Joint Conference on Neural Networks (IJCNN), 1-8, 2019 | 17 | 2019 |
Spemoticons: text to speech based emotional auditory cues G Németh, G Olaszy, TG Csapó International Community for Auditory Display, 2011 | 16 | 2011 |
Speaker dependent articulatory-to-acoustic mapping using real-time MRI of the vocal tract TG Csapó arXiv preprint arXiv:2008.00889, 2020 | 15 | 2020 |