Guangzhi Sun 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	535	534
h 指数	10	10
i10 指数	11	11

160

120

2019202020212022202320242 34 108 99 139 147

开放获取的出版物数量

查看全部

3 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

关注

Guangzhi Sun

University of Cambridge

在 cam.ac.uk 的电子邮件经过验证 - 首页

Speech and language technology conversational AI


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis G Sun, Y Zhang, RJ Weiss, Y Cao, H Zen, Y Wu ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020	141	2020
Generating diverse and natural text-to-speech samples using a quantized fine-grained vae and autoregressive prosody prior G Sun, Y Zhang, RJ Weiss, Y Cao, H Zen, A Rosenberg, B Ramabhadran, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	114*	2020
Salmonn: Towards generic hearing abilities for large language models C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang arXiv preprint arXiv:2310.13289, 2023	60	2023
Speaker diarisation using 2D self-attentive combination of embeddings G Sun, C Zhang, PC Woodland ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	39	2019
Transformer language models with LSTM-based cross-utterance information representation G Sun, C Zhang, PC Woodland ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	36	2021
Tree-constrained pointer generator for end-to-end contextual speech recognition G Sun, C Zhang, PC Woodland 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	25	2021
Connecting speech encoder and large language model for asr W Yu, C Tang, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	17	2024
Combination of deep speaker embeddings for diarisation G Sun, C Zhang, PC Woodland Neural Networks 141, 372-384, 2021	17	2021
Can contextual biasing remain effective with Whisper and GPT-2? G Sun, X Zheng, C Zhang, PC Woodland arXiv preprint arXiv:2306.01942, 2023	11	2023
Minimising biasing word errors for contextual ASR with the tree-constrained pointer generator G Sun, C Zhang, PC Woodland IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 345-354, 2022	11	2022
Tree-constrained pointer generator with graph neural network encodings for contextual speech recognition G Sun, C Zhang, PC Woodland arXiv preprint arXiv:2207.00857, 2022	10	2022
Fine-grained audio-visual joint representations for multimodal large language models G Sun, W Yu, C Tang, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang arXiv preprint arXiv:2310.05863, 2023	7	2023
End-to-end spoken language understanding with tree-constrained pointer generator G Sun, C Zhang, PC Woodland ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	7	2023
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch J Hwang, M Hira, C Chen, X Zhang, Z Ni, G Sun, P Ma, R Huang, V Pratap, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-9, 2023	6	2023
Knowledge-aware audio-grounded generative slot filling for limited annotated data G Sun, C Zhang, I Vulić, P Budzianowski, PC Woodland arXiv preprint arXiv:2307.01764, 2023	6	2023
Cross-utterance conditioned VAE for non-autoregressive text-to-speech Y Li, C Yu, G Sun, H Jiang, F Sun, W Zu, Y Wen, Y Yang, J Wang arXiv preprint arXiv:2205.04120, 2022	6	2022
Cross-utterance language models with acoustic error sampling G Sun, C Zhang, PC Woodland arXiv preprint arXiv:2009.01008, 2020	5	2020
Content-aware speaker embeddings for speaker diarisation G Sun, D Liu, C Zhang, PC Woodland ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	3	2021
Graph neural networks for contextual ASR with the tree-constrained pointer generator G Sun, C Zhang, PC Woodland IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024	2	2024
Extending large language models for speech and audio captioning C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	2	2024

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用