Evo-vit: Slow-fast token evolution for dynamic vision transformer Y Xu, Z Zhang, M Zhang, K Sheng, K Li, W Dong, L Zhang, C Xu, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 2964-2972, 2022 | 135 | 2022 |
Transformers in computational visual media: A survey Y Xu, H Wei, M Lin, Y Deng, K Sheng, M Zhang, F Tang, W Dong, ... Computational Visual Media 8, 33-62, 2022 | 106 | 2022 |
Multi-modal queried object detection in the wild Y Xu, M Zhang, C Fu, P Chen, X Yang, K Li, C Xu Advances in Neural Information Processing Systems 36, 2024 | 13 | 2024 |
Spike-driven transformer v2: Meta spiking neural network architecture inspiring the design of next-generation neuromorphic chips M Yao, J Hu, T Hu, Y Xu, Z Zhou, Y Tian, B Xu, G Li arXiv preprint arXiv:2404.03663, 2024 | 12 | 2024 |
Towards corruption-agnostic robust domain adaptation Y Xu, K Sheng, W Dong, B Wu, C Xu, BG Hu ACM Transactions on Multimedia Computing, Communications, and Applications …, 2022 | 7 | 2022 |
Exploring multi-modal contextual knowledge for open-vocabulary object detection Y Xu, M Zhang, X Yang, C Xu arXiv preprint arXiv:2308.15846, 2023 | 4 | 2023 |
Libra: Building Decoupled Vision System on Large Language Models Y Xu, X Yang, Y Song, C Xu arXiv preprint arXiv:2405.10140, 2024 | | 2024 |