Longlora: Efficient fine-tuning of long-context large language models Y Chen, S Qian, H Tang, X Lai, Z Liu, S Han, J Jia arXiv preprint arXiv:2309.12307, 2023 | 104 | 2023 |
On efficient transformer-based image pre-training for low-level vision W Li, X Lu, S Qian, J Lu, X Zhang, J Jia arXiv preprint arXiv:2112.10175, 2021 | 102 | 2021 |
Aggregation via separation: Boosting facial landmark detector with semi-supervised style translation S Qian, K Sun, W Wu, C Qian, J Jia Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 94 | 2019 |
Temporal Interlacing Network H Shao, S Qian, Y Liu AAAI Conference on Artificial Intelligence 2020, 2020 | 81 | 2020 |
Make a face: Towards arbitrary high fidelity face manipulation S Qian, KY Lin, W Wu, Y Liu, Q Wang, F Shen, C Qian, R He Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 78 | 2019 |
Blending anti-aliasing into vision transformer S Qian, H Shao, Y Zhu, M Li, J Jia Advances in Neural Information Processing Systems 34, 5416-5429, 2021 | 17 | 2021 |
What makes for good tokenizers in vision transformer? S Qian, Y Zhu, W Li, M Li, J Jia IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022 | 9 | 2022 |
Tagclip: Improving discrimination ability of open-vocabulary semantic segmentation J Li, P Chen, S Qian, J Jia arXiv preprint arXiv:2304.07547, 2023 | 8 | 2023 |
StraIT: Non-autoregressive Generation with Stratified Image Transformer S Qian, H Chang, Y Li, Z Zhang, J Jia, H Zhang arXiv preprint arXiv:2303.00750, 2023 | 5 | 2023 |
Visual cot: Unleashing chain-of-thought reasoning in multi-modal language models H Shao, S Qian, H Xiao, G Song, Z Zong, L Wang, Y Liu, H Li arXiv preprint arXiv:2403.16999, 2024 | 3 | 2024 |
Extending the Capacity of CVAE for Face Synthesis and Modeling S Qian, W Wu, Y Liu, B Zhu, F Shen NeurIPS 2018 Workshop on Relational Representation Learning, 2018 | 2 | 2018 |
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation X He, Q Liu, S Qian, X Wang, T Hu, K Cao, K Yan, M Zhou, J Zhang arXiv preprint arXiv:2404.15275, 2024 | 1 | 2024 |
Prompt Highlighter: Interactive Control for Multi-Modal LLMs Y Zhang, S Qian, B Peng, S Liu, J Jia Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |