Internvideo: General video foundation models via generative and discriminative learning Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ... arXiv preprint arXiv:2212.03191, 2022 | 204 | 2022 |
Videomae v2: Scaling video masked autoencoders with dual masking L Wang, B Huang, Z Zhao, Z Tong, Y He, Y Wang, Y Wang, Y Qiao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 193 | 2023 |
Internvideo-ego4d: A pack of champion solutions to ego4d challenges G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ... arXiv preprint arXiv:2211.09529, 2022 | 38 | 2022 |
Mgmae: Motion guided masking for video masked autoencoding B Huang, Z Zhao, G Zhang, Y Qiao, L Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 12 | 2023 |
Asymmetric masked distillation for pre-training small foundation models Z Zhao, B Huang, S Xing, G Wu, Y Qiao, L Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 3 | 2024 |