Dynamicvit: Efficient vision transformers with dynamic token sparsification Y Rao, W Zhao, B Liu, J Lu, J Zhou, CJ Hsieh Advances in neural information processing systems 34, 13937-13949, 2021 | 569 | 2021 |
Unleashing text-to-image diffusion models for visual perception W Zhao, Y Rao, Z Liu, B Liu, J Zhou, J Lu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 107 | 2023 |
Tifa: Accurate and interpretable text-to-image faithfulness evaluation with question answering Y Hu, B Liu, J Kasai, Y Wang, M Ostendorf, R Krishna, NA Smith Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 91 | 2023 |
Randomrooms: Unsupervised pre-training from synthetic shapes and randomized layouts for 3d object detection Y Rao, B Liu, Y Wei, J Lu, CJ Hsieh, J Zhou Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 56 | 2021 |
MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation B Liu, Y Rao, J Lu, J Zhou, CJ Hsieh ECCV 2020: European Conference on Computer Vision (ECCV 2020), 2020 | 41 | 2020 |
Robust object detection via instance-level temporal cycle confusion X Wang, TE Huang, B Liu, F Yu, X Wang, JE Gonzalez, T Darrell Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 28 | 2021 |
Multi-proxy wasserstein classifier for image classification B Liu, Y Rao, J Lu, J Zhou, CJ Hsieh Proceedings of the AAAI Conference on Artificial Intelligence 35 (10), 8618-8626, 2021 | 9 | 2021 |
An integrated optical neural network chip based on mach-zehnder interferometers X Zhao, Z Yu, B Liu, Y Li, H Chen, M Chen Asia Communications and Photonics Conference, Su2A. 71, 2018 | 8 | 2018 |
Efficient Inference of Vision Instruction-Following Models with Elastic Cache Z Liu, B Liu, J Wang, Y Dong, G Chen, Y Rao, R Krishna, J Lu arXiv preprint arXiv:2407.18121, 2024 | 1 | 2024 |
Matching-based Data Valuation for Generative Model J Yang, W Deng, B Liu, Y Huang, X Li arXiv preprint arXiv:2304.10701, 2023 | 1 | 2023 |
Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model B Liu, Y Dong, Y Wang, Y Rao, Y Tang, WC Ma, R Krishna arXiv preprint arXiv:2408.00754, 2024 | | 2024 |