Detection of chatter vibration in end milling applying disturbance observer Y Kakinuma, Y Sudo, T Aoyama CIRP annals 60 (1), 109-112, 2011 | 107 | 2011 |
DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models Y Peng, Y Sudo, S Muhammad, S Watanabe Proc. Interspeech 2023, 2023 | 26 | 2023 |
Reproducing whisper-style training using an open-source toolkit and publicly available data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 22 | 2023 |
Sound Event Aware Environmental Sound Segmentation with Mask U-Net Y Sudo, K Itoyama, K Nishida, K Nakadai Advanced Robotics 34 (20), pp. 1280-1290, 2020 | 17 | 2020 |
4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders Y Sudo, M Shakeel, B Yan, J Shi, S Watanabe Proc. Interspeech 2023, 2023 | 15 | 2023 |
Multichannel environmental sound segmentation with separately trained spectral and spatial features Y Sudo, K Itoyama, K Nishida, K Nakadai Applied Intelligence 51 (11), 8245-8259, 2021 | 15* | 2021 |
Environmental Sound Segmentation utilizing Mask U-Net Y Sudo, K Itoyama, K Nishida, K Nakadai IEEE/RSJ International Conference on Intelligent Robots and Systems, pp …, 2019 | 14 | 2019 |
OWSM v3. 1: Better and faster open whisper-style speech models based on e-branchformer Y Peng, J Tian, W Chen, S Arora, B Yan, Y Sudo, M Shakeel, K Choi, ... arXiv preprint arXiv:2401.16658, 2024 | 13 | 2024 |
エンドミル加工における外乱オブザーバを用いたセンサレスびびり振動検出技術の開発(第1報): 平均計時法を用いた高精度プロセスモニタリング 周藤唯, 柿沼康弘, 大西公平, 青山藤詞郎 精密工学会誌 77 (7), 707-712, 2011 | 12* | 2011 |
Improvement of DOA estimation by using quaternion output in sound event localization and detection Y Sudo, K Itoyama, K Nishida, K Nakadai Detection and Classification of Acoustic Scenes and Events (DCASE), pp. 244-247, 2019 | 11 | 2019 |
Multi-Channel Environmental Sound Segmentation Utilizing Sound Source Localization and Separation U-Net Y Sudo, K Itoyama, K Nishida, K Nakadai IEEE/SICE International Symposium on System Integration, pp. 382-387, 2021 | 9 | 2021 |
Abnormal sound detection apparatus and detection method Y Sudo US Patent 10,607,632, 2020 | 8 | 2020 |
Time-synchronous one-pass Beam Search for Parallel Online and Offline Transducers with Dynamic Block Training Y Sudo, M Shakeel, Y Peng, S Watanabe Proc. Interspeech 2023, 2023 | 6 | 2023 |
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification Y Peng, Y Sudo, M Shakeel, S Watanabe arXiv preprint arXiv:2402.12654, 2024 | 5 | 2024 |
Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search Y Sudo, M Shakeel, Y Fukumoto, Y Peng, S Watanabe ICASSP2024, 2024 | 3 | 2024 |
Empirical Sampling from Latent Utterance-wise Evidence Model for Missing Data ASR based on Neural Encoder-Decoder Model R Takeda, Y Sudo, K Nakadai, K Komatani Proc. Interspeech 2022, 3789-3793, 2022 | 3 | 2022 |
Abnormal sound determination apparatus and determination method Y Sudo US Patent 10,475,469, 2019 | 3 | 2019 |
Development of chatter vibration detection utilizing disturbance observer (1st report) precise sensor-less process monitoring utilizing average T method Y Sudo, Y Kakinuma, K Ohnishi, T Aoyama Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision …, 2011 | 3 | 2011 |
Online Adaptation of Fourier Series Based Acoustic Transfer Function Model to Improve Sound Source Localization and Separation Y Sudo, M Takigahira, H Tsuru, K Nakadai, H Nakajima | 2 | 2023 |
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation Y Sudo, K Hata, K Nakadai Proc. Interspeech 2023, 2023 | 2 | 2023 |