Domain adaptation of dnn acoustic models using knowledge distillation T Asami, R Masumura, Y Yamaguchi, H Masataki, Y Aono 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 97 | 2017 |
A transformer-based audio captioning model with keyword estimation Y Koizumi, R Masumura, K Nishida, M Yasuda, S Saito arXiv preprint arXiv:2007.00222, 2020 | 69 | 2020 |
Soft-target training with ambiguous emotional utterances for dnn-based speech emotion classification A Ando, S Kobashikawa, H Kamiyama, R Masumura, Y Ijima, Y Aono 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 47 | 2018 |
Online end-of-turn detection from speech based on stacked time-asynchronous sequential networks. R Masumura, T Asami, H Masataki, R Ishii, R Higashinaka Interspeech 2017, 1661-1665, 2017 | 45 | 2017 |
Neural Dialogue Context Online End-of-Turn Detection R Masumura, T Tanaka, A Ando, R Ishii, R Higashinaka, Y Aono Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue …, 2018 | 36 | 2018 |
Hierarchical transformer-based large-context end-to-end asr with large-context knowledge distillation R Masumura, N Makishima, M Ihori, A Takashima, T Tanaka, S Orihashi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 34 | 2021 |
Large context end-to-end automatic speech recognition via extension of hierarchical recurrent encoder-decoder models R Masumura, T Tanaka, T Moriya, Y Shinohara, T Oba, Y Aono ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 33 | 2019 |
Neural Error Corrective Language Models for Automatic Speech Recognition. T Tanaka, R Masumura, H Masataki, Y Aono INTERSPEECH, 401-405, 2018 | 32 | 2018 |
Customer satisfaction estimation in contact center calls based on a hierarchical multi-task model A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono, T Toda IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 715-728, 2020 | 30 | 2020 |
Neural confnet classification: Fully neural network based spoken utterance classification using word confusion networks R Masumura, Y Ijima, T Asami, H Masataki, R Higashinaka 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 30 | 2018 |
Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls. A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono INTERSPEECH, 1716-1720, 2017 | 28 | 2017 |
Improving neural text normalization with data augmentation at character-and morphological levels I Saito, J Suzuki, K Nishida, K Sadamitsu, S Kobashikawa, R Masumura, ... Proceedings of the Eighth International Joint Conference on Natural Language …, 2017 | 25 | 2017 |
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020 | 24 | 2020 |
Adversarial training for multi-task and multi-lingual joint modeling of utterance intent classification R Masumura, Y Shinohara, R Higashinaka, Y Aono Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018 | 22 | 2018 |
Sequence-level consistency training for semi-supervised end-to-end automatic speech recognition R Masumura, M Ihori, A Takashima, T Moriya, A Ando, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 21 | 2020 |
End-to-end japanese multi-dialect speech recognition and dialect identification with multi-task learning R Imaizumi, R Masumura, S Shiota, H Kiya APSIPA Transactions on Signal and Information Processing 11 (1), 2022 | 19 | 2022 |
Speech Emotion Recognition Based on Multi-Label Emotion Existence Model. A Ando, R Masumura, H Kamiyama, S Kobashikawa, Y Aono INTERSPEECH, 2818-2822, 2019 | 19 | 2019 |
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge. T Tanaka, R Masumura, T Moriya, T Oba, Y Aono INTERSPEECH, 2210-2214, 2019 | 19 | 2019 |
Parallel phonetically aware DNNs and LSTM-RNNs for frame-by-frame discriminative modeling of spoken language identification R Masumura, T Asami, H Masataki, Y Aono Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017 | 19 | 2017 |
Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition. R Masumura, S Hahm, A Ito INTERSPEECH, 1465-1468, 2011 | 17 | 2011 |