AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition S Chen, C Ge, Z Tong, J Wang, Y Song, J Wang, P Luo Conference on Neural Information Processing Systems (NeurIPS), 2022 | 330 | 2022 |
DiffusionDet: Diffusion Model for Object Detection S Chen, P Sun, Y Song, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 301 | 2023 |
CycleMLP: A MLP-like Architecture for Dense Visual Predictions S Chen, E Xie, C Ge, R Chen, D Liang, P Luo IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 256* | 2023 |
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest S Zhang, P Sun, S Chen, M Xiao, W Shao, W Zhang, K Chen, P Luo arXiv preprint arXiv:2307.03601, 2023 | 108 | 2023 |
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Y Yang, Q Li, ... arXiv preprint arXiv:2305.05662, 2023 | 60 | 2023 |
Watch only once: An end-to-end video action detection framework S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 54 | 2021 |
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Y Cong, M Xu, C Simon, S Chen, J Ren, Y Xie, JM Perez-Rua, ... International Conference on Learning Representations (ICLR), 2024, 2023 | 25 | 2023 |
Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning C Ge, J Wang, Z Tong, S Chen, Y Song, P Luo International Conference on Learning Representation (ICLR), 2023 | 22 | 2023 |
Efficient differentiable neural architecture search with meta kernels S Chen, Y Chen, S Yan, J Feng arXiv preprint arXiv:1912.04749, 2019 | 19 | 2019 |
Going Denser with Open-Vocabulary Part Segmentation P Sun, S Chen, C Zhu, F Xiao, P Luo, S Xie, Z Yan Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 18 | 2023 |
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer Y Mu, S Chen, M Ding, J Chen, R Chen, P Luo International Conference on Machine Learning, 16043-16061, 2022 | 14 | 2022 |
GenTron: Diffusion Transformers for Image and Video Generation S Chen, M Xu, J Ren, Y Cong, S He, Y Xie, A Sinha, P Luo, T Xiang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 13* | 2024 |
Towards High-Quality Temporal Action Detection with Sparse Proposals J Wu, P Sun, S Chen, J Yang, Z Qi, L Ma, P Luo arXiv preprint arXiv:2109.08847, 2021 | 9 | 2021 |
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation P Sun, Y Jiang, S Chen, S Zhang, B Peng, P Luo, Z Yuan arXiv preprint arXiv:2406.06525, 2024 | 1 | 2024 |
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis Y Mu, J Chen, Q Zhang, S Chen, Q Yu, C Ge, R Chen, Z Liang, M Hu, ... arXiv preprint arXiv:2402.16117, 2024 | 1 | 2024 |
Enhancing Your Trained DETRs with Box Refinement Y Chen, Q Chen, P Sun, S Chen, J Wang, J Cheng arXiv preprint arXiv:2307.11828, 2023 | 1 | 2023 |