DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models Y Peng, Y Sudo, S Muhammad, S Watanabe Proceedings of the Annual Conference of the International Speech …, 2023 | 26 | 2023 |
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 22 | 2023 |
4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders Y Sudo, M Shakeel, B Yan, J Shi, S Watanabe https://www.isca-archive.org/interspeech_2023/sudo23_interspeech.pdf, 2023 | 15 | 2023 |
Detecting earthquakes: a novel deep learning-based approach for effective disaster response M Shakeel, K Itoyama, K Nishida, K Nakadai Applied Intelligence 51 (11), 8305-8315, 2021 | 14 | 2021 |
OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Y Peng, J Tian, W Chen, S Arora, B Yan, Y Sudo, M Shakeel, K Choi, ... arXiv preprint arXiv:2401.16658, 2024 | 13 | 2024 |
EMC: Earthquake Magnitudes Classification on Seismic Signals via Convolutional Recurrent Networks M Shakeel, K Itoyama, K Nishida, K Nakadai 2021 IEEE/SICE International Symposium on System Integration (SII), 388-393, 2021 | 8 | 2021 |
Time-synchronous one-pass beam search for parallel online and offline transducers with dynamic block training Y Sudo, M Shakeel, Y Peng, S Watanabe Proceedings of the Annual Conference of the International Speech …, 2023 | 6 | 2023 |
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification Y Peng, Y Sudo, M Shakeel, S Watanabe arXiv preprint arXiv:2402.12654, 2024 | 5 | 2024 |
3D Convolution Recurrent Neural Networks for Multi-Label Earthquake Magnitude Classification M Shakeel, K Itoyama, K Nishida, K Nakadai Applied Sciences 12 (4), 2195, 2022 | 5 | 2022 |
Contextualized Automatic Speech Recognition With Attention-Based Bias Phrase Boosted Beam Search Y Sudo, M Shakeel, Y Fukumoto, Y Peng, S Watanabe ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Environmental sensing using millimeter wave sensor for extreme conditions M Shakeel, D Nardi, K Ohno, S Tadokoro 2015 IEEE International Symposium on Safety, Security, and Rescue Robotics …, 2015 | 3 | 2015 |
Assessment of a Beamforming Implementation Developed for Surface Sound Source Separation Z Zhong, M Shakeel, K Itoyama, K Nishida, K Nakadai 2021 IEEE/SICE International Symposium on System Integration (SII), 369-374, 2021 | 2 | 2021 |
Metric-based multimodal meta-learning for human movement identification via footstep recognition M Shakeel, K Itoyama, K Nishida, K Nakadai 2023 IEEE/SICE International Symposium on System Integration (SII), 1-8, 2023 | 1 | 2023 |
FPGA based Power-Efficient Edge Server to Accelerate Speech Interface for Socially Assistive Robotics H Gulzar, M Shakeel, K Itoyama, K Nakadai, K Nishida, H Amano, T Eda 2023 IEEE/SICE International Symposium on System Integration (SII), 1-6, 2023 | 1 | 2023 |
Streaming Automatic Speech Recognition with Re-blocking Processing Based on Integrated Voice Activity Detection Y Sudo, M Shakeel, K Nakadai, J Shi, S Watanabe Proc. Interspeech 2022, 4641-4645, 2022 | 1 | 2022 |
Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss M Shakeel, Y Sudo, Y Peng, S Watanabe arXiv preprint arXiv:2406.16120, 2024 | | 2024 |
4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders Y Sudo, M Shakeel, Y Fukumoto, B Yan, J Shi, Y Peng, S Watanabe arXiv preprint arXiv:2406.02950, 2024 | | 2024 |
Contextualized Automatic Speech Recognition with Dynamic Vocabulary Y Sudo, Y Fukumoto, M Shakeel, Y Peng, S Watanabe arXiv preprint arXiv:2405.13344, 2024 | | 2024 |
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation M Shakeel, Y Sudo, Y Peng, S Watanabe ICASSP 2024 Workshop on Hands-free Speech Communication and Microphone …, 2024 | | 2024 |
End-to-end integration of online and offline encoders using auxiliary losses for automatic speech recognition S Muhammad, S Yui, P Yifan, W Shinji 人工知能学会第二種研究会資料 2023 (Challenge-063), 03, 2023 | | 2023 |