Takuya Yoshioka 个人学术档案

引用次数

	总计	2019 年至今
引用	10114	7309
h 指数	46	36
i10 指数	128	92

2100

1050

525

1575

2008200920102011201220132014201520162017201820192020202120222023202454 55 85 83 125 202 233 284 402 562 630 706 774 1141 1526 2061 1081

开放获取的出版物数量

查看全部

3 篇文章

2 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Tomohiro NakataniNTT Communication Science Laboratories在 ieee.org 的电子邮件经过验证
Keisuke KinoshitaResearch Scientist at Google在 ieee.org 的电子邮件经过验证
Marc DelcroixNTT Communication Science Laboratories在 ieee.org 的电子邮件经过验证
Shoko ArakiNTT Communication Science Laboratories在 ieee.org 的电子邮件经过验证
Masakiyo FujimotoSenior researcher, National Institute of Information and Communications Technology在 nict.go.jp 的电子邮件经过验证
Nobutaka ItoUniversity of Tokyo, Japan (formerly NTT)在 k.u-tokyo.ac.jp 的电子邮件经过验证
Armin SehrOTH Regensburg在 oth-regensburg.de 的电子邮件经过验证
Roland MaasSr. Science Manager at Amazon在 amazon.com 的电子邮件经过验证
Shinji WatanabeCarnegie Mellon University在 cmu.edu 的电子邮件经过验证
Takaaki HoriApple在 apple.com 的电子邮件经过验证
Hiroshi G OkunoProfessor Emeritus, Kyoto University, Adjunct Researcher, Waseda University在 nue.org 的电子邮件经过验证
Takuya HiguchiApple在 apple.com 的电子邮件经过验证
Atsushi NakamuraGraduate School of Natural Sciences, Nagoya City University在 ieee.org 的电子邮件经过验证
Yotaro KuboGoogle Speech在 ieee.org 的电子邮件经过验证
Chengzhu Yu （俞承柱）Amazon在 amazon.com 的电子邮件经过验证
Mehrez SoudenSr. Manager, Apple Inc.在 gatech.edu 的电子邮件经过验证
Hirokazu KameokaSenior Distinguished Researcher at NTT, Adjunct Associate Professor at NII在 hco.ntt.co.jp 的电子邮件经过验证
Mark GalesCambridge University在 eng.cam.ac.uk 的电子邮件经过验证

关注

Takuya Yoshioka

AssemblyAI

在 assemblyai.com 的电子邮件经过验证 - 首页

speech recognition speech enhancement speaker diarization machine learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Wavlm: Large-scale self-supervised pre-training for full stack speech processing S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022	1078	2022
Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation Y Luo, Z Chen, T Yoshioka ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	716	2020
Speech dereverberation based on variance-normalized delayed linear prediction T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang IEEE Transactions on Audio, Speech, and Language Processing 18 (7), 1717-1731, 2010	483	2010
The REVERB challenge: A common evaluation framework for dereverberation and recognition of reverberant speech K Kinoshita, M Delcroix, T Yoshioka, T Nakatani, E Habets, ... 2013 IEEE Workshop on Applications of Signal Processing to Audio and …, 2013	457	2013
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research K Kinoshita, M Delcroix, S Gannot, EA P. Habets, R Haeb-Umbach, ... EURASIP Journal on Advances in Signal Processing 2016, 1-19, 2016	407	2016
Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition T Yoshioka, A Sehr, M Delcroix, K Kinoshita, R Maas, T Nakatani, ... IEEE Signal Processing Magazine 29 (6), 114-126, 2012	333	2012
Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening T Yoshioka, T Nakatani IEEE Transactions on Audio, Speech, and Language Processing 20 (10), 2707-2720, 2012	301	2012
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020	297	2020
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ... 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015	260	2015
Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise T Higuchi, N Ito, T Yoshioka, T Nakatani 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	254	2016
Continuous speech separation: Dataset and analysis Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	200	2020
Blind separation and dereverberation of speech mixtures by joint optimization T Yoshioka, T Nakatani, M Miyoshi, HG Okuno IEEE Transactions on Audio, Speech, and Language Processing 19 (1), 69-84, 2010	194	2010
Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008	187	2008
Icassp 2023 deep noise suppression challenge H Dubey, A Aazami, V Gopal, B Naderi, S Braun, R Cutler, A Ju, ... IEEE Open Journal of Signal Processing, 2024	165	2024
End-to-end microphone permutation and number invariant multi-channel speech separation Y Luo, Z Chen, N Mesgarani, T Yoshioka ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	163	2020
Multi-channel overlapped speech recognition with location guided speech extraction network Z Chen, X Xiao, T Yoshioka, H Erdogan, J Li, Y Gong 2018 IEEE Spoken Language Technology Workshop (SLT), 558-565, 2018	132	2018
Multi-microphone neural speech separation for far-field multi-talker speech recognition T Yoshioka, H Erdogan, Z Chen, F Alleva 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	130	2018
Continuous speech separation with conformer S Chen, Y Wu, Z Chen, J Wu, J Li, T Yoshioka, C Wang, S Liu, M Zhou ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	129	2021
Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB challenge M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ... Reverb workshop, 2014	126	2014
Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR T Higuchi, N Ito, S Araki, T Yoshioka, M Delcroix, T Nakatani IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (4), 780-793, 2017	124	2017

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用