Tokens-to-token vit: Training vision transformers from scratch on imagenet L Yuan, Y Chen, T Wang, W Yu, Y Shi, ZH Jiang, FEH Tay, J Feng, S Yan Proceedings of the IEEE/CVF international conference on computer vision, 558-567, 2021 | 2050 | 2021 |
DeepViT: Towards Deeper Vision Transformer D Zhou, B Kang, X Jin, L Yang, X Lian, Z Jiang, Q Hou, J Feng arXiv preprint arXiv:2103.11886, 2021 | 563 | 2021 |
Volo: Vision outlooker for visual recognition L Yuan, Q Hou, Z Jiang, J Feng, S Yan IEEE transactions on pattern analysis and machine intelligence 45 (5), 6575-6586, 2022 | 293 | 2022 |
All tokens matter: Token labeling for training better vision transformers ZH Jiang, Q Hou, L Yuan, D Zhou, Y Shi, X Jin, A Wang, J Feng Advances in neural information processing systems 34, 18590-18602, 2021 | 237* | 2021 |
Convbert: Improving bert with span-based dynamic convolution ZH Jiang, W Yu, D Zhou, Y Chen, J Feng, S Yan Advances in Neural Information Processing Systems 33, 12837-12848, 2020 | 185 | 2020 |
Reclor: A reading comprehension dataset requiring logical reasoning W Yu, Z Jiang, Y Dong, J Feng arXiv preprint arXiv:2002.04326, 2020 | 181 | 2020 |
Vision permutator: A permutable mlp-like architecture for visual recognition Q Hou, Z Jiang, L Yuan, MM Cheng, S Yan, J Feng IEEE transactions on pattern analysis and machine intelligence 45 (1), 1328-1334, 2022 | 177 | 2022 |
Joint 3d face reconstruction and dense face alignment from a single image with 2d-assisted self-supervised learning X Tu, J Zhao, Z Jiang, Y Luo, M Xie, Y Zhao, L He, Z Ma, J Feng arXiv preprint arXiv:1903.09359 1 (2), 2019 | 133* | 2019 |
Disentangled representation learning for 3d face shape ZH Jiang, Q Wu, K Chen, J Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 124 | 2019 |
Avatargen: a 3d generative model for animatable human avatars J Zhang, Z Jiang, D Yang, H Xu, Y Shi, G Song, Z Xu, X Wang, J Feng European Conference on Computer Vision, 668-685, 2022 | 73 | 2022 |
Refiner: Refining self-attention for vision transformers D Zhou, Y Shi, B Kang, W Yu, Z Jiang, Y Li, X Jin, Q Hou, J Feng arXiv preprint arXiv:2106.03714, 2021 | 66 | 2021 |
Mimicking the oracle: An initial phase decorrelation approach for class incremental learning Y Shi, K Zhou, J Liang, Z Jiang, J Feng, PHS Torr, S Bai, VYF Tan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 60 | 2022 |
Tm2d: Bimodality driven 3d dance generation via music-text integration K Gong, D Lian, H Chang, C Guo, Z Jiang, X Zuo, MB Mi, X Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 32 | 2023 |
Few-shot classification via adaptive attention Z Jiang, B Kang, K Zhou, J Feng arXiv preprint arXiv:2008.02465, 2020 | 27 | 2020 |
Omniavatar: Geometry-guided controllable 3d head synthesis H Xu, G Song, Z Jiang, J Zhang, Y Shi, J Liu, W Ma, J Feng, L Luo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 17 | 2023 |
Agilegan3d: Few-shot 3d portrait stylization by augmented transfer learning G Song, H Xu, J Liu, T Zhi, Y Shi, J Zhang, Z Jiang, J Feng, S Sang, L Luo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 5 | 2024 |
ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training R Wang, Q Yao, H Lai, Z He, X Tao, Z Jiang, SK Zhou arXiv preprint arXiv:2312.13316, 2023 | 3 | 2023 |
Carzero: Cross-attention alignment for radiology zero-shot classification H Lai, Q Yao, Z Jiang, R Wang, Z He, X Tao, SK Zhou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 2 | 2024 |
LV-BERT: Exploiting layer variety for BERT W Yu, Z Jiang, F Chen, Q Hou, J Feng arXiv preprint arXiv:2106.11740, 2021 | 2 | 2021 |
J. Feng et S. Yan,«Tokensto-token ViT: Training vision transformers from scratch on ImageNet» L Yuan, Y Chen, T Wang, W Yu, Y Shi, Z Jiang, FE Tay arXiv preprint arXiv 2101, 19, 2021 | 2 | 2021 |