Countermeasures for Automatic Speaker Verification Replay Spoofing Attack: On Data Augmentation, Feature Representation, Classification and Fusion. W Cai, D Cai, W Liu, G Li, M Li Interspeech, 17-21, 2017 | 86 | 2017 |
The DKU replay detection system for the ASVspoof 2019 challenge: On data augmentation, feature representation, classification, and fusion W Cai, H Wu, D Cai, M Li Interspeech, 1023-1027, 2019 | 69 | 2019 |
Utterance-level end-to-end language identification using attention-based CNN-BLSTM W Cai, D Cai, S Huang, M Li ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019 | 67 | 2019 |
Within-sample variability-invariant loss for robust speaker recognition under noisy environments D Cai, W Cai, M Li ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 56 | 2020 |
Far-Field End-to-End Text-Dependent Speaker Verification Based on Mixed Training Data with Transfer Learning and Enrollment Data Augmentation. X Qin, D Cai, M Li Interspeech, 4045-4049, 2019 | 47 | 2019 |
The dku-dukeece systems for voxceleb speaker recognition challenge 2020 W Wang, D Cai, X Qin, M Li arXiv preprint arXiv:2010.12731, 2020 | 45 | 2020 |
An iterative framework for self-supervised deep speaker representation learning D Cai, W Wang, M Li ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021 | 39 | 2021 |
The dku-dukeece-lenovo system for the diarization task of the 2021 voxceleb speaker recognition challenge W Wang, D Cai, Q Lin, L Yang, J Wang, J Wang, M Li arXiv preprint arXiv:2109.02002, 2021 | 31 | 2021 |
Cancellable speech template via random binary orthogonal matrices projection hashing KY Chee, Z Jin, D Cai, M Li, WS Yap, YL Lai, BM Goi Pattern Recognition 76, 273-287, 2018 | 27 | 2018 |
End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum. D Cai, Z Ni, W Liu, W Cai, G Li, M Li INTERSPEECH, 3452-3456, 2017 | 27 | 2017 |
Multi-Channel Training for End-to-End Speaker Recognition Under Reverberant and Noisy Environment. D Cai, X Qin, M Li Interspeech, 4365-4369, 2019 | 25 | 2019 |
Incorporating visual information in audio based self-supervised speaker recognition D Cai, W Wang, M Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1422-1435, 2022 | 20 | 2022 |
Pretraining conformer with asr for speaker verification D Cai, W Wang, M Li, R Xia, C Huang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 17 | 2023 |
The DKU system for the speaker recognition task of the 2019 VOiCES from a distance challenge D Cai, X Qin, W Cai, M Li Interspeech, 2493--2497, 2019 | 16 | 2019 |
Similarity measurement of segment-level speaker embeddings in speaker diarization W Wang, Q Lin, D Cai, M Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2645-2658, 2022 | 15 | 2022 |
The DKU-DukeECE system for the self-supervision speaker verification task of the 2021 VoxCeleb speaker recognition challenge D Cai, M Li arXiv preprint arXiv:2109.02853, 2021 | 15 | 2021 |
End-to-end language identification using NetFV and NetVLAD J Chen, W Cai, D Cai, Z Cai, H Zhong, M Li 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018 | 15 | 2018 |
Identifying source speakers for voice conversion based spoofing attacks on speaker verification systems D Cai, Z Cai, M Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
Robust multi-channel far-field speaker verification under different in-domain data availability scenarios X Qin, D Cai, M Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 71-85, 2022 | 12 | 2022 |
The DKU-JNU-EMA electromagnetic articulography database on Mandarin and Chinese dialects with tandem feature based acoustic-to-articulatory inversion Z Cai, X Qin, D Cai, M Li, X Liu, H Zhong 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018 | 12 | 2018 |