关注
Shoufa Chen
Shoufa Chen
在 connect.hku.hk 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
S Chen, C Ge, Z Tong, J Wang, Y Song, J Wang, P Luo
Conference on Neural Information Processing Systems (NeurIPS), 2022
3302022
DiffusionDet: Diffusion Model for Object Detection
S Chen, P Sun, Y Song, P Luo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
3012023
CycleMLP: A MLP-like Architecture for Dense Visual Predictions
S Chen, E Xie, C Ge, R Chen, D Liang, P Luo
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
256*2023
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
S Zhang, P Sun, S Chen, M Xiao, W Shao, W Zhang, K Chen, P Luo
arXiv preprint arXiv:2307.03601, 2023
1082023
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Y Yang, Q Li, ...
arXiv preprint arXiv:2305.05662, 2023
602023
Watch only once: An end-to-end video action detection framework
S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
542021
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Y Cong, M Xu, C Simon, S Chen, J Ren, Y Xie, JM Perez-Rua, ...
International Conference on Learning Representations (ICLR), 2024, 2023
252023
Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning
C Ge, J Wang, Z Tong, S Chen, Y Song, P Luo
International Conference on Learning Representation (ICLR), 2023
222023
Efficient differentiable neural architecture search with meta kernels
S Chen, Y Chen, S Yan, J Feng
arXiv preprint arXiv:1912.04749, 2019
192019
Going Denser with Open-Vocabulary Part Segmentation
P Sun, S Chen, C Zhu, F Xiao, P Luo, S Xie, Z Yan
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
182023
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
Y Mu, S Chen, M Ding, J Chen, R Chen, P Luo
International Conference on Machine Learning, 16043-16061, 2022
142022
GenTron: Diffusion Transformers for Image and Video Generation
S Chen, M Xu, J Ren, Y Cong, S He, Y Xie, A Sinha, P Luo, T Xiang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
13*2024
Towards High-Quality Temporal Action Detection with Sparse Proposals
J Wu, P Sun, S Chen, J Yang, Z Qi, L Ma, P Luo
arXiv preprint arXiv:2109.08847, 2021
92021
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
P Sun, Y Jiang, S Chen, S Zhang, B Peng, P Luo, Z Yuan
arXiv preprint arXiv:2406.06525, 2024
12024
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Y Mu, J Chen, Q Zhang, S Chen, Q Yu, C Ge, R Chen, Z Liang, M Hu, ...
arXiv preprint arXiv:2402.16117, 2024
12024
Enhancing Your Trained DETRs with Box Refinement
Y Chen, Q Chen, P Sun, S Chen, J Wang, J Cheng
arXiv preprint arXiv:2307.11828, 2023
12023
系统目前无法执行此操作,请稍后再试。
文章 1–16