Celeba-spoof: Large-scale face anti-spoofing dataset with rich annotations Y Zhang, ZF Yin, Y Li, G Yin, J Yan, J Shao, Z Liu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 174 | 2020 |
Lamm: Language-assisted multi-modal instruction-tuning dataset, framework, and benchmark Z Yin, J Wang, J Cao, Z Shi, D Liu, M Li, X Huang, Z Wang, L Sheng, L Bai, ... Advances in Neural Information Processing Systems 36, 2024 | 68 | 2024 |
INTERN: A New Learning Paradigm Towards General Vision J Shao, S Chen, Y Li, K Wang, Z Yin, Y He, J Teng, Q Sun, M Gao, J Liu, ... arXiv preprint arXiv:2111.08687, 2021 | 31 | 2021 |
Benchmarking omni-vision representation through the lens of visual realms Y Zhang, Z Yin, J Shao, Z Liu European Conference on Computer Vision, 594-611, 2022 | 19 | 2022 |
Few-Shot Domain Expansion for Face Anti-Spoofing B Yang, J Zhang, Z Yin, J Shao arXiv preprint arXiv:2106.14162, 2021 | 17 | 2021 |
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy Y Zhang, Q Sun, Y Zhou, Z He, Z Yin, K Wang, L Sheng, Y Qiao, J Shao, ... arXiv preprint arXiv:2203.07845, 2022 | 14 | 2022 |
CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results Y Zhang, Z Yin, J Shao, Z Liu, S Yang, Y Xiong, W Xia, Y Xu, M Luo, J Liu, ... arXiv preprint arXiv:2102.12642, 2021 | 14 | 2021 |
Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models Z You, Z Li, J Gu, Z Yin, T Xue, C Dong arXiv preprint arXiv:2312.08962, 2023 | 8 | 2023 |
Octavius: Mitigating Task Interference in MLLMs via MoE Z Chen, Z Wang, Z Wang, H Liu, Z Yin, S Liu, L Sheng, W Ouyang, Y Qiao, ... arXiv preprint arXiv:2311.02684, 2023 | 8 | 2023 |
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities C Lu, C Qian, G Zheng, H Fan, H Gao, J Zhang, J Shao, J Deng, J Fu, ... arXiv preprint arXiv:2401.15071, 2024 | 7 | 2024 |
3D Point Cloud Pre-training with Knowledge Distillation from 2D Images Y Yao, Y Zhang, Z Yin, J Luo, W Ouyang, X Huang arXiv preprint arXiv:2212.08974, 2022 | 7 | 2022 |
X-learner: Learning cross sources and tasks for universal visual representation Y He, G Huang, S Chen, J Teng, K Wang, Z Yin, L Sheng, Z Liu, Y Qiao, ... European Conference on Computer Vision, 509-528, 2022 | 6 | 2022 |
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception Y Qin, E Zhou, Q Liu, Z Yin, L Sheng, R Zhang, Y Qiao, J Shao arXiv preprint arXiv:2312.07472, 2023 | 5 | 2023 |
ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models Z Shi, Z Wang, H Fan, Z Yin, L Sheng, Y Qiao, J Shao arXiv preprint arXiv:2311.02692, 2023 | 5 | 2023 |
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models D Liu, X Huang, Y Hou, Z Wang, Z Yin, Y Gong, P Gao, W Ouyang arXiv preprint arXiv:2402.03327, 2024 | 4 | 2024 |
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control E Zhou, Y Qin, Z Yin, Y Huang, R Zhang, L Sheng, Y Qiao, J Shao arXiv preprint arXiv:2403.12037, 2024 | 3 | 2024 |
Robust Face Anti-Spoofing with Dual Probabilistic Modeling Y Zhang, Y Wu, Z Yin, J Shao, Z Liu arXiv preprint arXiv:2204.12685, 2022 | 3 | 2022 |
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models C Qian, J Zhang, W Yao, D Liu, Z Yin, Y Qiao, Y Liu, J Shao arXiv preprint arXiv:2402.19465, 2024 | 2 | 2024 |
Methods, apparatuses, devices, storage media and program products for determining performance parameters Y Zhang, Z Yin, YIN Guojun, J Shao US Patent App. 17/740,968, 2022 | 1 | 2022 |
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents Z Chen, Z Shi, X Lu, L He, S Qian, HS Fang, Z Yin, W Ouyang, J Shao, ... arXiv preprint arXiv:2403.19622, 2024 | | 2024 |