关注
Emiru Tsunoo
Emiru Tsunoo
在 jp.sony.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Multi-accdoa: Localizing and detecting overlapping sounds from the same class with auxiliary duplicating permutation invariant training
K Shimada, Y Koyama, S Takahashi, N Takahashi, E Tsunoo, Y Mitsufuji
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
692022
Transformer ASR with contextual block processing
E Tsunoo, Y Kashiwagi, T Kumakura, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
682019
Beyond timbral statistics: Improving music classification using percussive patterns and bass lines
E Tsunoo, G Tzanetakis, N Ono, S Sagayama
IEEE Transactions on Audio, Speech, and Language Processing 19 (4), 1003-1014, 2010
49*2010
Streaming transformer asr with blockwise synchronous beam search
E Tsunoo, Y Kashiwagi, S Watanabe
2021 IEEE Spoken Language Technology Workshop (SLT), 22-29, 2021
472021
Harmonic and percussive sound separation and its application to MIR-related tasks
N Ono, K Miyamoto, H Kameoka, J Le Roux, Y Uchiyama, E Tsunoo, ...
Advances in music information retrieval, 213-236, 2010
432010
Audio genre classification using percussive pattern clustering combined with timbral features
E Tsunoo, G Tzanetakis, N Ono, S Sagayama
2009 IEEE International Conference on Multimedia and Expo, 382-385, 2009
402009
Information processing device, method of information processing, and program
Y Taki, S Kawano, T Shibuya, E Tsunoo
US Patent 10,546,582, 2020
372020
Autoregressive MFCC Models for Genre Classification Improved by Harmonic-percussion Separation.
H Rump, S Miyabe, E Tsunoo, N Ono, S Sagayama
ISMIR, 87-92, 2010
352010
Towards online end-to-end transformer automatic speech recognition
E Tsunoo, Y Kashiwagi, T Kumakura, S Watanabe
arXiv preprint arXiv:1910.11871, 2019
342019
Rhythm map: Extraction of unit rhythmic patterns and analysis of rhythmic structure from music acoustic signals
E Tsunoo, N Ono, S Sagayama
2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009
332009
Ensemble of ACCDOA-and EINV2-based systems with D3Nets and impulse response simulation for sound event localization and detection
K Shimada, N Takahashi, Y Koyama, S Takahashi, E Tsunoo, ...
arXiv preprint arXiv:2106.10806, 2021
252021
Hierarchical Recurrent Neural Network for Story Segmentation.
E Tsunoo, P Bell, S Renals
INTERSPEECH, 2919-2923, 2017
252017
Music mood classification by rhythm and bass-line unit pattern analysis
E Tsunoo, T Akase, N Ono, S Sagayama
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
242010
Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification.
E Tsunoo, N Ono, S Sagayama
ISMIR, 219-224, 2009
212009
Making punctuation restoration robust and fast with multi-task learning and knowledge distillation
M Hentschel, E Tsunoo, T Okuda
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
142021
Residual language model for end-to-end speech recognition
E Tsunoo, Y Kashiwagi, C Narisetty, S Watanabe
arXiv preprint arXiv:2206.07430, 2022
132022
Spatial data augmentation with simulated room impulse responses for sound event localization and detection
Y Koyama, K Shigemi, M Takahashi, K Shimada, N Takahashi, E Tsunoo, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
132022
Streaming transformer ASR with blockwise synchronous inference
E Tsunoo, Y Kashiwagi, S Watanabe
arXiv preprint arXiv:2006.14941, 2020
112020
Data augmentation methods for end-to-end speech recognition on distant-talk scenarios
E Tsunoo, K Shibata, C Narisetty, Y Kashiwagi, S Watanabe
arXiv preprint arXiv:2106.03419, 2021
102021
Joint speech recognition and audio captioning
C Narisetty, E Tsunoo, X Chang, Y Kashiwagi, M Hentschel, S Watanabe
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
92022
系统目前无法执行此操作,请稍后再试。
文章 1–20