SimMIM: A simple framework for masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, J Bao, Z Yao, Q Dai, H Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 1077 | 2022 |
Trajectory-based modeling of human actions with motion reference points YG Jiang, Q Dai, X Xue, W Liu, CW Ngo Computer Vision–ECCV 2012: 12th European Conference on Computer Vision …, 2012 | 295 | 2012 |
Self-supervised learning with swin transformers Z Xie, Y Lin, Z Yao, Z Zhang, Q Dai, Y Cao, H Hu arXiv preprint arXiv:2105.04553, 2021 | 179 | 2021 |
Weakly-supervised action localization by generative attention modeling B Shi, Q Dai, Y Mu, J Wang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 176 | 2020 |
On the connection between local attention and dynamic depth-wise convolution Q Han, Z Fan, Q Dai, L Sun, MM Cheng, J Liu, J Wang International Conference on Learning Representations, 2022 | 145* | 2022 |
Learning spatial awareness to improve crowd counting ZQ Cheng, JX Li, Q Dai, X Wu, AG Hauptmann Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 144 | 2019 |
Recurrent tubelet proposal and recognition networks for action detection D Li, Z Qiu, Q Dai, T Yao, T Mei Proceedings of the European conference on computer vision (ECCV), 303-318, 2018 | 134 | 2018 |
Deep incremental hashing network for efficient image retrieval D Wu, Q Dai, J Liu, B Li, W Wang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 120 | 2019 |
Informative Dropout for Robust Representation Learning: A Shape-bias Perspective B Shi, D Zhang, Q Dai, Z Zhu, Y Mu, J Wang Proceedings of the 37th International Conference on Machine Learning, 8828--8839, 2020 | 109 | 2020 |
Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning. Q Dai, RW Zhao, Z Wu, X Wang, Z Gu, W Wu, YG Jiang MediaEval 1436, 2015 | 95 | 2015 |
Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting ZQ Cheng, JX Li, Q Dai, X Wu, JY He, A Hauptmann Proceedings of the 27th ACM International Conference on Multimedia, 1897-1906, 2019 | 87 | 2019 |
Human action recognition in unconstrained videos by explicit motion modeling YG Jiang, Q Dai, W Liu, X Xue, CW Ngo IEEE Transactions on Image Processing 24 (11), 3781-3795, 2015 | 87 | 2015 |
Rethinking spatial invariance of convolutional networks for object counting ZQ Cheng, Q Dai, H Li, J Song, X Wu, AG Hauptmann Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 85 | 2022 |
Super fast event recognition in internet videos YG Jiang, Q Dai, T Mei, Y Rui, SF Chang IEEE Transactions on Multimedia 17 (8), 1174-1186, 2015 | 69 | 2015 |
Fast semantic diffusion for large-scale context-based image and video annotation YG Jiang, Q Dai, J Wang, CW Ngo, X Xue, SF Chang IEEE Transactions on Image Processing 21 (6), 3080-3091, 2012 | 62 | 2012 |
Decoupling Localization and Classification in Single Shot Temporal Action Detection Y Huang, Q Dai, Y Lu 2019 IEEE International Conference on Multimedia and Expo (ICME), 2019 | 57 | 2019 |
SVFormer: Semi-supervised video transformer for action recognition Z Xing, Q Dai, H Hu, J Chen, Z Wu, YG Jiang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 56 | 2023 |
HiViT: A simpler and more efficient design of hierarchical vision transformer X Zhang, Y Tian, L Xie, W Huang, Q Dai, Q Ye, Q Tian The Eleventh International Conference on Learning Representations, 2023 | 53* | 2023 |
On data scaling in masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, Y Wei, Q Dai, H Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 41 | 2023 |
Binary optimized hashing Q Dai, J Li, J Wang, YG Jiang Proceedings of the 24th ACM international conference on Multimedia, 1247-1256, 2016 | 38 | 2016 |