Convolutional maxout neural networks for low-resource speech recognition M Cai, Y Shi, J Kang, J Liu, T Su The 9th International Symposium on Chinese Spoken Language Processing, 133-137, 2014 | 21 | 2014 |
Advanced recurrent network-based hybrid acoustic models for low resource speech recognition J Kang, WQ Zhang, WW Liu, J Liu, MT Johnson EURASIP Journal on Audio, Speech, and Music Processing 2018, 1-15, 2018 | 18 | 2018 |
Gated recurrent units based hybrid acoustic models for robust speech recognition J Kang, WQ Zhang, J Liu 2016 10th International Symposium on Chinese Spoken Language Processing …, 2016 | 18 | 2016 |
High-performance Swahili keyword search with very limited language pack: The THUEE system for the OpenKWS15 evaluation M Cai, Z Lv, C Lu, J Kang, L Hui, Z Zhang, J Liu 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 14 | 2015 |
Leveraging phone mask training for phonetic-reduction-robust e2e uyghur speech recognition G Ma, P Hu, J Kang, S Huang, H Huang arXiv preprint arXiv:2204.00819, 2022 | 11 | 2022 |
Gated convolutional networks based hybrid acoustic models for low resource speech recognition J Kang, WQ Zhang, J Liu 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 11 | 2017 |
Glalt: Global-local attention-augmented light transformer for scene text recognition H Zhang, G Luo, J Kang, S Huang, X Wang, FY Wang IEEE Transactions on Neural Networks and Learning Systems, 2023 | 8 | 2023 |
The TNT Team System Descriptions of Cantonese and Mongolian for IARPA OpenASR20. J Zhao, Z Lv, A Han, GB Wang, GX Shi, J Kang, J Yan, P Hu, S Huang, ... Interspeech, 4344-4348, 2021 | 8 | 2021 |
Neuron sparseness versus connection sparseness in deep neural network for large vocabulary speech recognition J Kang, C Lu, M Cai, WQ Zhang, J Liu 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 8 | 2015 |
Linguistic-acoustic similarity based accent shift for accent recognition Q Shao, J Yan, J Kang, P Guo, X Shi, P Hu, L Xie arXiv preprint arXiv:2204.03398, 2022 | 5 | 2022 |
Lattice based transcription loss for end-to-end speech recognition J Kang, WQ Zhang, WW Liu, J Liu, MT Johnson Journal of Signal Processing Systems 90, 1013-1023, 2018 | 5 | 2018 |
An LSTM-CTC based verification system for proxy-word based OOV keyword search Z Lv, J Kang, WQ Zhang, J Liu 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 4 | 2017 |
End-to-End Neural Speaker Diarization with Absolute Speaker Loss C Wang, J Li, X Fang, J Kang, Y Li Proc. INTERSPEECH, 3577-3581, 2023 | 3 | 2023 |
Improved system fusion for keyword search Z Lv, M Cai, C Lu, J Kang, L Hui, WQ Zhang, J Liu 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 3 | 2015 |
Icpr 2022 challenge on multi-modal subtitle recognition S Huang, S Huang, L Lu, P Hu, L Wang, X Wang, J Kang, W Liang, L Jin, ... 2022 26th International Conference on Pattern Recognition (ICPR), 4974-4980, 2022 | 1 | 2022 |
The TNT Team System Descriptions of Cantonese, Mongolian and Kazakh for IARPA OpenASR21 Challenge K Tang, J Zhao, J Yan, J Kang, H Wang, J Li, S Chai, GB Wang, S Huang, ... 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022 | | 2022 |
Multimedia Simultaneous Translation System for Minority Language Communication with Mandarin. S Huang, B Hu, S Huang, P Hu, J Kang, Z Lv, J Yan, Q Ju, S Kang, D Tuo, ... INTERSPEECH, 4628-4629, 2019 | | 2019 |
The TNT Team System Descriptions for OpenASR21 K Tang, J Yan, J Kang, S Huang, P Hu, J Zhao, H Wang, J Li, S Chai, ... | | |
The TNT team system descriptions for IARPA OpenASR20 Z Lv, J Yan, P Hu, J Kang, J Zhao, G Shi, GB Wang, A Han, S Huang, ... | | |