Lamm: Language-assisted multi-modal instruction-tuning dataset, framework, and benchmark Z Yin, J Wang, J Cao, Z Shi, D Liu, M Li, X Huang, Z Wang, L Sheng, L Bai, ... Advances in Neural Information Processing Systems 36, 2024 | 100 | 2024 |
Uni3d-llm: Unifying point cloud perception, generation and editing with large language models D Liu, X Huang, Y Hou, Z Wang, Z Yin, Y Gong, P Gao, W Ouyang arXiv preprint arXiv:2402.03327, 2024 | 6 | 2024 |
3daxiesprompts: Unleashing the 3d spatial task capabilities of gpt-4v D Liu, X Dong, R Zhang, X Luo, P Gao, X Huang, Y Gong, Z Wang arXiv preprint arXiv:2312.09738, 2023 | 5 | 2023 |
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT L Zhuo, R Du, H Xiao, Y Li, D Liu, R Huang, W Liu, L Zhao, FY Wang, ... arXiv preprint arXiv:2406.18583, 2024 | | 2024 |