Unitr: A unified and efficient multi-modal transformer for bird's-eye-view representation H Wang, H Tang, S Shi, A Li, Z Li, B Schiele, L Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 49 | 2023 |
Git: Towards generalist vision transformer through universal language interface H Wang, H Tang, L Jiang, S Shi, MF Naeem, H Li, B Schiele, L Wang European Conference on Computer Vision, 55-73, 2025 | 3 | 2025 |