Mvster: Epipolar transformer for efficient multi-view stereo X Wang, Z Zhu, G Huang, F Qin, Y Ye, Y He, X Chi, X Wang European Conference on Computer Vision, 573-591, 2022 | 77 | 2022 |
Openoccupancy: A large scale benchmark for surrounding semantic occupancy perception X Wang, Z Zhu, W Xu, Y Zhang, Y Wei, X Chi, Y Ye, D Du, J Lu, X Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 71 | 2023 |
Drivedreamer: Towards real-world-driven world models for autonomous driving X Wang, Z Zhu, G Huang, X Chen, J Lu arXiv preprint arXiv:2309.09777, 2023 | 49 | 2023 |
On the road with gpt-4v (ision): Early explorations of visual-language model on autonomous driving BS Licheng Wen, Xuemeng Yang, Daocheng Fu, Xiaofeng Wang, Pinlong Cai, Xin ... The Thirteenth International Conference on Learning Representations Workshop …, 2024 | 35* | 2024 |
Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion WZ B Li, Y Sun, Z Liang, D Du, Z Zhang, X Wang, Y Wang, X Jin Proceedings of the Thirty-Third International Joint Conference on Artificial …, 2024 | 12* | 2024 |
Are we ready for vision-centric driving streaming perception? the asap benchmark X Wang, Z Zhu, Y Zhang, G Huang, Y Ye, W Xu, Z Chen, X Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 11 | 2023 |
Crafting monocular cues and velocity guidance for self-supervised multi-frame depth learning X Wang, Z Zhu, G Huang, X Chi, Y Ye, Z Chen, X Wang Proceedings of the AAAI Conference on Artificial Intelligence 37 (3), 2689-2697, 2023 | 8 | 2023 |
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens X Wang, Z Zhu, G Huang, B Wang, X Chen, J Lu arXiv preprint arXiv:2401.09985, 2024 | 6 | 2024 |
Drivedreamer-2: Llm-enhanced world models for diverse driving video generation G Zhao, X Wang, Z Zhu, X Chen, G Huang, X Bao, X Wang arXiv preprint arXiv:2403.06845, 2024 | 4 | 2024 |
Liftedcl: Lifting contrastive learning for human-centric perception Z Chen, Q Li, X Wang, W Yang The Eleventh International Conference on Learning Representations, 2022 | 4 | 2022 |
Is sora a world simulator? a comprehensive survey on general world models and beyond Z Zhu, X Wang, W Zhao, C Min, N Deng, M Dou, Y Wang, B Shi, K Wang, ... arXiv preprint arXiv:2405.03520, 2024 | 3 | 2024 |
A Multimodal Neural Network for Contact State Recognition During Probe Implantation into Skull Holes Y Song, X Wang, D Zhang 2023 IEEE 19th International Conference on Automation Science and …, 2023 | | 2023 |