LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild S Yang, Y Zhang, D Feng, M Yang, C Wang, J Xiao, K Long, S Shan, ... 2019 14th International Conference on Automatic Face and Gesture Recognition …, 2019 | 182 | 2019 |
Deformation Flow Based Two-Stream Network for Lip Reading J Xiao, S Yang, Y Zhang, S Shan, X Chen 2020 15th IEEE International Conference on Automatic Face and Gesture …, 2020 | 78 | 2020 |
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Y Zhang, S Yang, J Xiao, S Shan, X Chen 2020 15th IEEE International Conference on Automatic Face and Gesture …, 2020 | 77 | 2020 |
M3F: Multi-Modal Continuous Valence-Arousal Estimation in the Wild YH Zhang, R Huang, J Zeng, S Shan 2020 15th IEEE International Conference on Automatic Face and Gesture …, 2020 | 47 | 2020 |
UniCon: Unified Context Network for Robust Active Speaker Detection Y Zhang, S Liang, S Yang, X Liu, Z Wu, S Shan, X Chen Proceedings of the 29th ACM International Conference on Multimedia, 3964-3972, 2021 | 33 | 2021 |
Multi-Task Learning for Audio-Visual Active Speaker Detection YH Zhang, J Xiao, S Yang, S Shan The ActivityNet Large-Scale Activity Recognition Challenge 2019, 2019 | 29 | 2019 |
UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022 Y Zhang, S Liang, S Yang, S Shan The ActivityNet Large-Scale Activity Recognition Challenge 2022, 2022 | 3 | 2022 |
ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2021 Y Zhang, S Liang, S Yang, X Liu, Z Wu, S Shan The ActivityNet Large-Scale Activity Recognition Challenge 2021, 2021 | 3 | 2021 |
ES³: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations Y Zhang, S Yang, S Shan, X Chen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |