Masked autoencoders are scalable vision learners K He, X Chen, S Xie, Y Li, P Dollár, R Girshick Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 5790 | 2022 |
Adaptive batch normalization for practical domain adaptation Y Li, N Wang, J Shi, X Hou, J Liu Pattern Recognition 80, 109-117, 2018 | 1248* | 2018 |
Multiscale vision transformers H Fan, B Xiong, K Mangalam, Y Li, Z Yan, J Malik, C Feichtenhofer Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 1187 | 2021 |
Scale-aware trident networks for object detection Y Li, Y Chen, N Wang, Z Zhang Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 1038 | 2019 |
Co-occurrence feature learning for skeleton based action recognition using regularized deep LSTM networks W Zhu, C Lan, J Xing, W Zeng, Y Li, L Shen, X Xie Proceedings of the AAAI conference on artificial intelligence 30 (1), 2016 | 965 | 2016 |
Demystifying neural style transfer Y Li, N Wang, J Liu, X Hou International Joint Conference on Artificial Intelligence (IJCAI), 2017 | 715 | 2017 |
Ego4d: Around the world in 3,000 hours of egocentric video K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 649 | 2022 |
Exploring plain vision transformer backbones for object detection Y Li, H Mao, R Girshick, K He European conference on computer vision, 280-296, 2022 | 619 | 2022 |
Mvitv2: Improved multiscale vision transformers for classification and detection Y Li, CY Wu, H Fan, K Mangalam, B Xiong, J Malik, C Feichtenhofer Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 594 | 2022 |
Masked autoencoders as spatiotemporal learners C Feichtenhofer, Y Li, K He Advances in neural information processing systems 35, 35946-35958, 2022 | 383 | 2022 |
PKU-MMD: A large scale benchmark for skeleton-based human action understanding C Liu, Y Hu, Y Li, S Song, J Liu Proceedings of the workshop on visual analysis in smart and connected …, 2017 | 323* | 2017 |
Online human action detection using joint classification-regression recurrent neural networks Y Li, C Lan, J Xing, W Zeng, C Yuan, J Liu Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016 | 230 | 2016 |
Factorized bilinear models for image recognition Y Li, N Wang, J Liu, X Hou International Conference on Computer Vision (ICCV), 2017 | 215 | 2017 |
Scaling language-image pre-training via masking Y Li, H Fan, R Hu, C Feichtenhofer, K He Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 201 | 2023 |
Memvit: Memory-augmented multiscale vision transformer for efficient long-term video recognition CY Wu, Y Li, K Mangalam, H Fan, B Xiong, J Malik, C Feichtenhofer Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 167 | 2022 |
Benchmarking detection transfer learning with vision transformers Y Li, S Xie, X Chen, P Dollar, K He, R Girshick arXiv preprint arXiv:2111.11429, 2021 | 148 | 2021 |
Ego-topo: Environment affordances from egocentric video T Nagarajan, Y Li, C Feichtenhofer, K Grauman Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 118 | 2020 |
Pyslowfast H Fan, Y Li, B Xiong, WY Lo, C Feichtenhofer | 100 | 2020 |
Deep Joint Discriminative Learning for Vehicle Re-identification and Retrieval Y Li, Y Li, H Yan, J Liu Image Processing (ICIP), 2017 IEEE International Conference on, 2017 | 72 | 2017 |
Multi-modality multi-task recurrent neural network for online action detection J Liu, Y Li, S Song, J Xing, C Lan, W Zeng IEEE Transactions on Circuits and Systems for Video Technology 29 (9), 2667-2682, 2018 | 67 | 2018 |