Crossformer++: A versatile vision transformer hinging on cross-scale attention W Wang, W Chen, Q Qiu, L Chen, B Wu, B Lin, X He, W Liu IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 225 | 2023 |
Do Wider Neural Networks Really Help Adversarial Robustness? B Wu, J Chen, D Cai, X He, Q Gu Proc. of Advances in Neural Information Processing Systems (NeurIPS) 34, 2021, 2020 | 107* | 2020 |
Clip is also an efficient segmenter: A text-driven approach for weakly supervised semantic segmentation Y Lin, M Chen, W Wang, B Wu, K Li, B Lin, H Liu, X He Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 83 | 2023 |
GD-MAE: generative decoder for MAE pre-training on lidar point clouds H Yang, T He, J Liu, H Chen, B Wu, B Lin, X He, W Ouyang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 49 | 2023 |
Towards efficient adversarial training on vision transformers B Wu, J Gu, Z Li, D Cai, X He, W Liu European Conference on Computer Vision, 307-325, 2022 | 41 | 2022 |
One-shot implicit animatable avatars with model-based priors Y Huang, H Yi, W Liu, H Wang, B Wu, W Wang, B Lin, D Zhang, D Cai Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 28 | 2023 |
Attacking adversarial attacks as a defense B Wu, H Pan, L Shen, J Gu, S Zhao, Z Li, D Cai, X He, W Liu arXiv preprint arXiv:2106.04938, 2021 | 28 | 2021 |
Correlation maximized structural similarity loss for semantic segmentation S Zhao, B Wu, W Chu, Y Hu, D Cai arXiv preprint arXiv:1910.08711, 2019 | 26 | 2019 |
Exploring the relationship between architectural design and adversarially robust generalization A Liu, S Tang, S Liang, R Gong, B Wu, X Liu, D Tao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 23* | 2023 |
Weakm3d: Towards weakly supervised monocular 3d object detection L Peng, S Yan, B Wu, Z Yang, X He, D Cai arXiv preprint arXiv:2203.08332, 2022 | 21 | 2022 |
Learning occupancy for monocular 3d object detection L Peng, J Xu, H Cheng, Z Yang, X Wu, W Qian, W Wang, B Wu, D Cai Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 7 | 2024 |
Normkd: Normalized logits for knowledge distillation Z Chi, T Zheng, H Li, Z Yang, B Wu, B Lin, D Cai arXiv preprint arXiv:2308.00520, 2023 | 6 | 2023 |
Improving semantic segmentation via dilated affinity B Wu, S Zhao, W Chu, Z Yang, D Cai arXiv preprint arXiv:1907.07011, 2019 | 6 | 2019 |
APPT: Asymmetric parallel point transformer for 3D point cloud understanding H Li, T Zheng, Z Chi, Z Yang, W Wang, B Wu, B Lin, D Cai arXiv preprint arXiv:2303.17815, 2023 | 5 | 2023 |
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Y Yang, W Wang, L Peng, C Song, Y Chen, H Li, X Yang, Q Lu, D Cai, ... arXiv preprint arXiv:2403.11627, 2024 | 2 | 2024 |
Local Conditional Controlling for Text-to-Image Diffusion Models Y Zhao, L Peng, Y Yang, Z Luo, H Li, Y Chen, W Zhao, W Liu, B Wu arXiv preprint arXiv:2312.08768, 2023 | 2 | 2023 |
Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning L Peng, H Cheng, Z Yang, R Zhao, L Xia, C Song, Q Lu, W Liu, B Wu arXiv preprint arXiv:2311.17536, 2023 | 1 | 2023 |
Searching Priors Makes Text-to-Video Synthesis Better H Cheng, L Peng, L Xia, Y Hu, H Li, Q Lu, X He, B Wu arXiv preprint arXiv:2406.03215, 2024 | | 2024 |
Temporal Feature Fusion for 3D Detection in Monocular Video H Cheng, L Peng, Z Yang, B Lin, X He, B Wu IEEE Transactions on Image Processing, 2024 | | 2024 |
Object Detectors in the Open Environment: Challenges, Solutions, and Outlook S Liang, W Wang, R Chen, A Liu, B Wu, EC Chang, X Cao, D Tao arXiv preprint arXiv:2403.16271, 2024 | | 2024 |