Conditional sound generation using neural discrete time-frequency representation learning X Liu, T Iqbal, J Zhao, Q Huang, MD Plumbley, W Wang 2021 IEEE 31st International Workshop on Machine Learning for Signal …, 2021 | 53 | 2021 |
An encoder-decoder based audio captioning system with transfer and reinforcement learning X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... arXiv preprint arXiv:2108.02752, 2021 | 46 | 2021 |
Separate what you describe: Language-queried audio source separation X Liu, H Liu, Q Kong, X Mei, J Zhao, Q Huang, MD Plumbley, W Wang arXiv preprint arXiv:2203.15147, 2022 | 33 | 2022 |
Leveraging pre-trained bert for audio captioning X Liu, X Mei, Q Huang, J Sun, J Zhao, H Liu, MD Plumbley, V Kilic, ... 2022 30th European Signal Processing Conference (EUSIPCO), 1145-1149, 2022 | 28 | 2022 |
An encoder-decoder based audio captioning system with transfer and reinforcement learning for DCASE challenge 2021 task 6 X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... DCASE2021 Challenge, Tech. Rep, Tech. Rep, 2021 | 15 | 2021 |
Robust real-time object detection based on deep learning for very high resolution remote sensing images Y Zhao, J Zhao, C Zhao, W Xiong, Q Li, J Yang IGARSS 2019-2019 IEEE International Geoscience and Remote Sensing Symposium …, 2019 | 14 | 2019 |
Generative Zero-Shot Prompt Learning for Cross-Domain Slot Filling with Inverse Prompting X Li, L Wang, G Dong, K He, J Zhao, H Lei, J Liu, W Xu Findings of the Association for Computational Linguistics: ACL 2023., 2023 | 12 | 2023 |
PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling G Dong, D Guo, L Wang, X Li, Z Wang, C Zeng, K He, J Zhao, H Lei, X Cui, ... Proceedings of the 29th International Conference on Computational Linguistics, 2022 | 11 | 2022 |
Deep neural decision forest for acoustic scene classification J Sun, X Liu, X Mei, J Zhao, MD Plumbley, V Kılıç, W Wang 2022 30th European Signal Processing Conference (EUSIPCO), 772-776, 2022 | 9 | 2022 |
Fish feeding intensity assessment in aquaculture: A new audio dataset AFFIA3K and a deep learning algorithm M Cui, X Liu, J Zhao, J Sun, G Lian, T Chen, MD Plumbley, D Li, W Wang 2022 IEEE 32nd International Workshop on Machine Learning for Signal …, 2022 | 8 | 2022 |
A robust contrastive alignment method for multi-domain text classification X Li, H Lei, L Wang, G Dong, J Zhao, J Liu, W Xu, C Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
Audio-visual tracking of multiple speakers via a pmbm filter J Zhao, P Wu, X Liu, Y Xu, L Mihaylova, S Godsill, W Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 7 | 2022 |
Towards robust and generalizable training: An empirical study of noisy slot filling for input perturbations J Liu, L Wang, G Dong, X Song, Z Wang, Z Wang, S Lei, J Zhao, K He, ... arXiv preprint arXiv:2310.03518, 2023 | 4 | 2023 |
Partial arithmetic Consensus based distributed intensity particle flow SMC-PHD filter for multi-target tracking P Wu, J Zhao, S Goudarzi, W Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 4 | 2022 |
Visually assisted self-supervised audio speaker localization and tracking J Zhao, P Wu, S Goudarzi, X Liu, J Sun, Y Xu, W Wang 2022 30th European Signal Processing Conference (EUSIPCO), 787-791, 2022 | 3 | 2022 |
Audio-visual speaker tracking: Progress, challenges, and future directions J Zhao, Y Xu, X Qian, D Berghi, P Wu, M Cui, J Sun, PJB Jackson, ... arXiv preprint arXiv:2310.14778, 2023 | 2 | 2023 |
Audio Visual Speaker Localization from EgoCentric Views J Zhao, Y Xu, X Qian, W Wang arXiv preprint arXiv:2309.16308, 2023 | 2 | 2023 |
Advanced machine learning methods for autonomous classification of ground vehicles with acoustic data X Liu, Q Li, J Liang, J Zhao, P Wu, C Lyu, S Goudarzi, J George, T Pham, ... Artificial Intelligence and Machine Learning for Multi-Domain Operations …, 2022 | 2 | 2022 |
Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter. J Zhao, P Wu, X Liu, S Goudarzi, H Liu, Y Xu, W Wang INTERSPEECH, 3704-3708, 2022 | 2 | 2022 |
Attention-Based End-to-End Differentiable Particle Filter for Audio Speaker Tracking J Zhao, Y Xu, X Qian, H Liu, MD Plumbley, W Wang IEEE Open Journal of Signal Processing, 2024 | 1 | 2024 |