Efficient Low-rank Multimodal Fusion with Modality-Specific Factors Z Liu, Y Shen, VB Lakshminarasimhan, PP Liang, A Zadeh, LP Morency Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018 | 762 | 2018 |
Words can shift: Dynamically adjusting word representations using nonverbal behaviors Y Wang, Y Shen, Z Liu, PP Liang, A Zadeh, LP Morency Proceedings of the AAAI conference on artificial intelligence 33 (01), 7216-7223, 2019 | 390 | 2019 |
Multiinstruct: Improving multi-modal zero-shot learning via instruction tuning Z Xu, Y Shen, L Huang arXiv preprint arXiv:2212.10773, 2022 | 74 | 2022 |
Efficient low-rank multimodal fusion with modality-specific factors. arXiv 2018 Z Liu, Y Shen, VB Lakshminarasimhan, PP Liang, A Zadeh, LP Morency arXiv preprint arXiv:1806.00064, 1806 | 23 | 1806 |
The art of socratic questioning: Recursive thinking with large language models J Qi, Z Xu, Y Shen, M Liu, D Jin, Q Wang, L Huang arXiv preprint arXiv:2305.14999, 2023 | 10 | 2023 |
Vision-flan: Scaling human-labeled tasks in visual instruction tuning Z Xu, C Feng, R Shao, T Ashby, Y Shen, D Jin, Y Cheng, Q Wang, ... arXiv preprint arXiv:2402.11690, 2024 | 6 | 2024 |
X-eval: Generalizable multi-aspect text evaluation via augmented instruction tuning with auxiliary evaluation aspects M Liu, Y Shen, Z Xu, Y Cao, E Cho, V Kumar, R Ghanadan, L Huang arXiv preprint arXiv:2311.08788, 2023 | 5 | 2023 |
The art of SOCRATIC QUESTIONING: zero-shot multimodal reasoning with recursive thinking and selfquestioning J Qi, Z Xu, Y Shen, M Liu, D Jin, Q Wang, L Huang CoRR, abs/2305.14999, 2023 | 5 | 2023 |
MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain Everyday Tasks J Qi, M Liu, Y Shen, Z Xu, L Huang Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18888 …, 2024 | 1 | 2024 |
Multimodal Instruction Tuning with Conditional Mixture of LoRA Y Shen, Z Xu, Q Wang, Y Cheng, W Yin, L Huang arXiv preprint arXiv:2402.15896, 2024 | 1 | 2024 |
Learning by asking for embodied visual navigation and task completion Y Shen, I Lourentzou arXiv preprint arXiv:2302.04865, 2023 | 1 | 2023 |
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling J Gu, Y Shen, S Zhai, Y Zhang, N Jaitly, JM Susskind arXiv preprint arXiv:2405.21048, 2024 | | 2024 |
Many-to-many Image Generation with Auto-regressive Diffusion Models Y Shen, Y Zhang, S Zhai, L Huang, JM Susskind, J Gu arXiv preprint arXiv:2404.03109, 2024 | | 2024 |
KnowledgeBot: Improving Assistive Robot for Task Completion and Live Interaction via Neuro-Symbolic Reasoning M Liu, Y Shen, BMYS Wang, JQZ Xu, L Huang | | |