PixArt-: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis J Chen, J Yu, C Ge, L Yao, E Xie, Y Wu, Z Wang, J Kwok, P Luo, H Lu, Z Li ICLR 2024 Spotlight, 2023 | 106 | 2023 |
Deepaccident: A motion and accident prediction benchmark for v2x autonomous driving T Wang, S Kim, J Wenxuan, E Xie, C Ge, J Chen, Z Li, P Luo Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 5599-5606, 2024 | 29 | 2024 |
Metabev: Solving sensor failures for 3d detection and map segmentation C Ge, J Chen, E Xie, Z Wang, L Hong, H Lu, Z Li, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 29* | 2023 |
Pixart-\sigma: Weak-to-strong training of diffusion transformer for 4k text-to-image generation J Chen, C Ge, E Xie, Y Wu, L Yao, X Ren, Z Wang, P Luo, H Lu, Z Li ECCV 2024, 2024 | 20 | 2024 |
A survey of reasoning with foundation models J Sun, C Zheng, E Xie, Z Liu, R Chu, J Qiu, J Xu, M Ding, H Li, M Geng, ... arXiv preprint arXiv:2312.11562, 2023 | 18 | 2023 |
Arkittrack: a new diverse dataset for tracking using mobile RGB-D data H Zhao, J Chen, L Wang, H Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 12 | 2023 |
Pixart-{\delta}: Fast and controllable image generation with latent consistency models J Chen, Y Wu, S Luo, E Xie, S Paul, P Luo, H Zhao, Z Li ICML 2024 workshop, 2024 | 6 | 2024 |
Fast training of diffusion transformer with extreme masking for 3d point clouds generation S Mo, E Xie, Y Wu, J Chen, M Nießner, Z Li ECCV 2024, 2023 | 3 | 2023 |
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers P Gao, L Zhuo, Z Lin, C Liu, J Chen, R Du, E Xie, X Luo, L Qiu, Y Zhang, ... arXiv preprint arXiv:2405.05945, 2024 | 1 | 2024 |