Internvideo: General video foundation models via generative and discriminative learning Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ... arXiv preprint arXiv:2212.03191, 2022 | 204 | 2022 |
Videomae v2: Scaling video masked autoencoders with dual masking L Wang, B Huang, Z Zhao, Z Tong, Y He, Y Wang, Y Wang, Y Qiao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 193 | 2023 |
Internvideo-ego4d: A pack of champion solutions to ego4d challenges G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ... arXiv preprint arXiv:2211.09529, 2022 | 38 | 2022 |
Mgmae: Motion guided masking for video masked autoencoding B Huang, Z Zhao, G Zhang, Y Qiao, L Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 12 | 2023 |
Multi-scale enhanced active learning for skeleton-based action recognition Y Zhang, Z Zhao, W Li, L Duan 2021 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2021 | 4 | 2021 |
Asymmetric Masked Distillation for Pre-Training Small Foundation Models Z Zhao, B Huang, S Xing, G Wu, Y Qiao, L Wang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 3 | 2024 |