Vision Transformer with Deformable Attention Z Xia*, X Pan*, S Song, LE Li, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022, 2022 | 383 | 2022 |
3D Object Detection with Pointformer X Pan, Z Xia, S Song, LE Li, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021, 2021 | 345 | 2021 |
On the Integration of Self-Attention and Convolution X Pan, C Ge, R Lu, S Song, G Chen, Z Huang, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022, 2022 | 314 | 2022 |
Implicit Semantic Data Augmentation for Deep Networks Y Wang*, X Pan*, S Song, H Zhang, C Wu, G Huang Advances in Neural Information Processing Systems (NeurIPS) 2019, 2019 | 193 | 2019 |
Regularizing Deep Networks with Semantic Data Augmentation Y Wang, G Huang, S Song, X Pan, Y Xia, C Wu IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 | 143 | 2021 |
FLatten Transformer: Vision Transformer using Focused Linear Attention D Han*, X Pan*, Y Han, S Song, G Huang International Conference on Computer Vision (ICCV) 2023, 2023 | 59 | 2023 |
ActiveNeRF: Learning where to See with Uncertainty Estimation X Pan, Z Lai, S Song, G Huang European Conference on Computer Vision (ECCV) 2022, 2022 | 59 | 2022 |
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention X Pan, T Ye, Z Xia, S Song, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023, 2023 | 31 | 2023 |
Contrastive Language-Image Pre-Training with Knowledge Graphs X Pan, T Ye, D Han, S Song, G Huang Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022 | 29 | 2022 |
Dynamic Perceiver for Efficient Visual Recognition Y Han, D Han, Z Liu, Y Wang, X Pan, Y Pu, C Deng, J Feng, S Song, ... International Conference on Computer Vision (ICCV) 2023, 2023 | 23 | 2023 |
A Unified Framework for Convolution-based Graph Neural Networks X Pan, S Song, G Huang URL https://openreview. net/forum, 2021 | 20* | 2021 |
Joint Representation Learning for Text and 3D Point Cloud R Huang*, X Pan*, H Zheng, H Jiang, Z Xie, S Song, G Huang arXiv preprint arXiv:2301.07584, 2023 | 8 | 2023 |
Gsva: Generalized segmentation via multimodal large language models Z Xia, D Han, Y Han, X Pan, S Song, G Huang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 6 | 2024 |
Budgeted Training for Vision Transformer Z Xia*, X Pan*, X Jin, Y He, S Song, G Huang International Conference on Learning Representations (ICLR) 2023, 2023 | 6* | 2023 |
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention Z Xia, X Pan, S Song, LE Li, G Huang arXiv preprint arXiv:2309.01430, 2023 | 4 | 2023 |
PLAM: A Plug-in Module for Flexible Graph Attention Learning X Pan, S Song, Y Chen, L Wang, G Huang Neurocomputing 480, 76-88, 2022 | 2 | 2022 |