Mict: Mixed 3d/2d convolutional tube for human action recognition Y Zhou, X Sun, ZJ Zha, W Zeng Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 271 | 2018 |
One-Shot Neural Architecture Search Through A Posteriori Distribution Guided Sampling Y Zhou, X Sun, C Luo, ZJ Zha, W Zeng arXiv preprint arXiv:1906.09557, 2019 | 180* | 2019 |
Context-reinforced semantic segmentation Y Zhou, X Sun, ZJ Zha, W Zeng Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 62 | 2019 |
Adaptive pooling in multi-instance learning for web video annotation Y Zhou, X Sun, D Liu, Z Zha, W Zeng Proceedings of the IEEE International Conference on Computer Vision …, 2017 | 54 | 2017 |
Spatiotemporal Fusion in 3D CNNs: A Probabilistic View Y Zhou, X Sun, C Luo, ZJ Zha, W Zeng Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2020 | 28 | 2020 |
Unsupervised visual representation learning by tracking patches in video G Wang, Y Zhou, C Luo, W Xie, W Zeng, Z Xiong Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 21 | 2021 |
Posterior-Guided Neural Architecture Search Y Zhou, X Sun, C Luo, ZJ Zha, W Zeng Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 2020 | 8 | 2020 |
Distribution consistent neural architecture search J Pan, C Sun, Y Zhou, Y Zhang, C Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 6 | 2022 |
Inter-x: Towards versatile human-human interaction analysis L Xu, X Lv, Y Yan, X Jin, S Wu, C Xu, Y Liu, Y Zhou, F Rao, X Sheng, Y Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 3 | 2024 |
Image captioning with multi-context synthetic data F Ma, Y Zhou, F Rao, Y Zhang, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4089-4097, 2024 | 2 | 2024 |
VAE^ 2: Preventing posterior collapse of variational video predictions in the wild Y Zhou, C Luo, X Sun, ZJ Zha, W Zeng arXiv preprint arXiv:2101.12050, 2021 | 1 | 2021 |
Visual Perception by Large Language Model's Weights F Ma, H Xue, G Wang, Y Zhou, F Rao, S Yan, Y Zhang, S Wu, MZ Shou, ... arXiv preprint arXiv:2405.20339, 2024 | | 2024 |
Multi-Modal Generative Embedding Model F Ma, H Xue, G Wang, Y Zhou, F Rao, S Yan, Y Zhang, S Wu, MZ Shou, ... arXiv preprint arXiv:2405.19333, 2024 | | 2024 |
Task Navigator: Decomposing Complex Tasks for Multimodal Large Language Models F Ma, Y Zhou, Y Zhang, S Wu, Z Zhang, Z He, F Rao, X Sun Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |
ReGenNet: Towards Human Action-Reaction Synthesis L Xu, Y Zhou, Y Yan, X Jin, W Zhu, F Rao, X Yang, W Zeng Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |
Text-Only Image Captioning with Multi-Context Data Generation. F Ma, Y Zhou, F Rao, Y Zhang, X Sun CoRR, 2023 | | 2023 |
ReGenNet: Towards Human Action-Reaction Synthesis* Appendix L Xu, Y Zhou, Y Yan, X Jin, W Zhu, F Rao, X Yang, W Zeng | | |