Adding Conditional Control to Text-to-Image Diffusion Models L Zhang, A Rao, M Agrawala Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023 | 1811 | 2023 |
HotFlip: White-Box Adversarial Examples for Text Classification J Ebrahimi, A Rao, D Lowd, D Dou Proceedings of Annual Meeting of the Association for Computational Linguistics, 2018 | 1114 | 2018 |
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Y Guo, C Yang, A Rao, Z Liang, Y Wang, Y Qiao, M Agrawala, D Lin, ... International Conference on Learning Representations, 2024 | 224 | 2024 |
MovieNet: A Holistic Dataset for Movie Understanding Q Huang, Y Xiong, A Rao, J Wang, D Lin European Conference on Computer Vision, 2020 | 202 | 2020 |
BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering Y Xiangli, L Xu, X Pan, N Zhao, A Rao, C Theobalt, B Dai, D Lin European Conference on Computer Vision, 2022 | 162 | 2022 |
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation A Rao, L Xu, Y Xiong, G Xu, Q Huang, B Zhou, D Lin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 134 | 2020 |
A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language B Su, D Du, Z Yang, Y Zhou, J Li, A Rao, H Sun, Z Lu, JR Wen arXiv preprint arXiv:2209.05481, 2022 | 72* | 2022 |
A Unified Framework for Shot Type Classification Based on Subject Centric Lens A Rao, J Wang, L Xu, X Jiang, Q Huang, B Zhou, D Lin European Conference on Computer Vision, 2020 | 66 | 2020 |
CityNeRF: Building NeRF at City Scale Y Xiangli, L Xu, X Pan, N Zhao, A Rao, C Theobalt, B Dai, D Lin arXiv preprint arXiv:2112.05504, 2021 | 47 | 2021 |
Online Multi-modal Person Search in Videos J Xia, A Rao*, Q Huang, L Xu, J Wen, D Lin European Conference on Computer Vision, 2020 | 31 | 2020 |
White-Box Adversarial Examples for NLP J Ebrahimi, A Rao, D Lowd, D Dou arXiv preprint arXiv:1712.06751, 2017 | 19* | 2017 |
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models Y Guo, C Yang, A Rao, M Agrawala, D Lin, B Dai arXiv preprint arXiv:2311.16933, 2023 | 18 | 2023 |
Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences Y Zhou, H Duan, A Rao, B Su, J Wang Proceedings of the AAAI Conference on Artificial Intelligence, 2023 | 17 | 2023 |
ControlNet L Zhang, A Rao, M Agrawala | 12* | 2023 |
Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos X Jiang, L Jin, A Rao*, L Xu, D Lin IEEE Transactions on Multimedia, 2021 | 10 | 2021 |
Computer Vision–ECCV 2022 Workshops: Tel Aviv, Israel, October 23–27, 2022, Proceedings, AI for Creative Video Editing and Understanding A Rao, F Caba, D Liu, A Pardo, Y Xiong, V Escorcia, A Thabet, B Ghanem, ... Springer Nature, 2023 | 8* | 2023 |
Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production A Rao, X Jiang, Y Guo, L Xu, L Yang, L Jin, D Lin, B Dai ACM SIGGRAPH Special Interest Group on Computer Graphics and Interactive …, 2023 | 8 | 2023 |
AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation X Liu, X Xu, A Rao, C Gan, L Yi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 8 | 2022 |
BlockPlanner: City Block Generation With Vectorized Graph Representation L Xu, Y Xiangli, A Rao, N Zhao, B Dai, Z Liu, D Lin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 7 | 2021 |
A Coarse-to-Fine Framework for Automatic Video Unscreen A Rao, L Xu, Z Li, Q Huang, Z Kuang, W Zhang, D Lin IEEE Transactions on Multimedia, 2022 | 6 | 2022 |