What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? Y Zeng, H Zhang, J Zheng, J Xia, G Wei, Y Wei, Y Zhang, T Kong, R Song Proceedings of the 2024 Conference of the North American Chapter of the …, 2024 | 64 | 2024 |
Make pixels dance: High-dynamic video generation Y Zeng, G Wei, J Zheng, J Zou, Y Wei, Y Zhang, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 46 | 2024 |