Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 294 | 2024 |
Building compact local pairwise codebook with joint feature space clustering N Morioka, S Satoh Computer Vision–ECCV 2010: 11th European Conference on Computer Vision …, 2010 | 91 | 2010 |
Robust visual reranking via sparsity and ranking constraints N Morioka, J Wang Proceedings of the 19th ACM international conference on Multimedia, 533-542, 2011 | 35 | 2011 |
Traffic signals control system N Morioka, EE Huang, B Hengst US Patent 8,212,688, 2012 | 33 | 2012 |
Reranking using confident image samples J Wang, S Li, N Morioka US Patent 9,384,241, 2016 | 30 | 2016 |
Libritts-r: A restored multi-speaker text-to-speech corpus Y Koizumi, H Zen, S Karita, Y Ding, K Yatabe, N Morioka, M Bacchiani, ... arXiv preprint arXiv:2305.18802, 2023 | 29 | 2023 |
Learning Directional Local Pairwise Bases with Sparse Coding. N Morioka, Shin'ichi Satoh 0001 BMVC 1617, 1621, 2010 | 25 | 2010 |
Compact correlation coding for visual object categorization N Morioka, S Satoh 2011 International conference on computer vision, 1639-1646, 2011 | 22 | 2011 |
Multidimensional shape constraints M Gupta, E Louidor, O Mangylov, N Morioka, T Narayan, S Zhao International Conference on Machine Learning, 3918-3928, 2020 | 20 | 2020 |
Generalized lasso based approximation of sparse coding for visual recognition N Morioka, S Satoh Advances in Neural Information Processing Systems 24, 2011 | 19 | 2011 |
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation Y Jia, Y Ding, A Bapna, C Cherry, Y Zhang, A Conneau, N Morioka arXiv preprint arXiv:2203.13339, 2022 | 18 | 2022 |
Miipher: A robust speech restoration model integrating self-supervised speech and text representations Y Koizumi, H Zen, S Karita, Y Ding, K Yatabe, N Morioka, Y Zhang, W Han, ... 2023 IEEE Workshop on Applications of Signal Processing to Audio and …, 2023 | 13 | 2023 |
Virtuoso: Massive multilingual speech-text joint semi-supervised learning for text-to-speech T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
E3 tts: Easy end-to-end diffusion-based text to speech Y Gao, N Morioka, Y Zhang, N Chen 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 10 | 2023 |
Residual adapters for few-shot text-to-speech speaker adaptation N Morioka, H Zen, N Chen, Y Zhang, Y Ding arXiv preprint arXiv:2210.15868, 2022 | 10 | 2022 |
Learning object representations using sequential patterns N Morioka Australasian Joint Conference on Artificial Intelligence, 551-561, 2008 | 8 | 2008 |
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data T Saeki, G Wang, N Morioka, I Elias, K Kastner, A Rosenberg, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |
Heiga Zen, Nanxin Chen, Yu Zhang, and Yifan Ding. Residual adapters for few-shot text-to-speech speaker adaptation N Morioka arXiv preprint arXiv:2210.15868, 2022 | 5 | 2022 |
Monotonic Kronecker-factored lattice WT Bakst, N Morioka, E Louidor International Conference on Learning Representations, 2021 | 5 | 2021 |
Libritts-r: Restoration of a large-scale multi-speaker tts corpus Y Koizumi, H Zen, S Karita, Y Ding, K Yatabe, N Morioka, MAU Bacchiani, ... | 1 | 2023 |