关注
Zhaoyang Liu
标题
引用次数
引用次数
年份
Tam: Temporal adaptive module for video recognition
Z Liu, L Wang, W Wu, C Qian, T Lu
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
3082021
Teinet: Towards an efficient architecture for video recognition
Z Liu, D Luo, Y Wang, L Wang, Y Tai, C Wang, J Li, F Huang, T Lu
Proceedings of the AAAI conference on artificial intelligence 34 (07), 11669 …, 2020
2442020
Motionbert: A unified perspective on learning human motion representations
W Zhu, X Ma, Z Liu, L Liu, W Wu, Y Wang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
99*2023
Dynamic sampling networks for efficient action recognition in videos
YD Zheng*, Z Liu*, T Lu, L Wang (* denotes equal contribution)
IEEE Transactions on Image Processing 29, 7970-7983, 2020
832020
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language
Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ...
arXiv preprint arXiv:2305.05662, 2023
662023
Context-aware attention LSTM network for flood prediction
Z Liu, W Xu, J Feng, S Palaiahnakote, T Lu
2018 24th international conference on pattern recognition (ICPR), 1301-1306, 2018
362018
Joint-modal label denoising for weakly-supervised audio-visual video parsing
H Cheng, Z Liu, H Zhou, C Qian, W Wu, L Wang
European Conference on Computer Vision, 431-448, 2022
232022
Controlllm: Augment language models with tools by searching on graphs
Z Liu, Z Lai, Z Gao, E Cui, Z Li, X Zhu, L Lu, Q Chen, Y Qiao, J Dai, ...
arXiv preprint arXiv:2310.17796, 2023
212023
Data-juicer: A one-stop data processing system for large language models
D Chen, Y Huang, Z Ma, H Chen, X Pan, C Ge, D Gao, Y Xie, Z Liu, J Gao, ...
Companion of the 2024 International Conference on Management of Data, 120-134, 2024
192024
Progressive attention on multi-level dense difference maps for generic event boundary detection
J Tang, Z Liu, C Qian, W Wu, L Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
182022
DeeperForensics Challenge 2020 on real-world face forgery detection: Methods and results
L Jiang, Z Guo, W Wu, Z Liu, Z Liu, CC Loy, S Yang, Y Xiong, W Xia, ...
arXiv preprint arXiv:2102.09471, 2021
122021
LLMs Meet Multimodal Generation and Editing: A Survey
Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu, R Yuan, Y Xing, W Wang, ...
arXiv preprint arXiv:2405.19334, 2024
42024
Context and temporal aware attention model for flood prediction
Z Liu, Y Wu, Y Ding, J Feng, T Lu
Advances in Multimedia Information Processing–PCM 2018: 19th Pacific-Rim …, 2018
42018
Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation
H Cheng, Z Liu, W Wu, L Wang
International Conference on Learning Representations 2023 (ICLR), 0
3*
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
J Wu, M Zhong, S Xing, Z Lai, Z Liu, W Wang, Z Chen, X Zhu, L Lu, T Lu, ...
arXiv preprint arXiv:2406.08394, 2024
12024
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
Z Tian, Z Liu, R Yuan, J Pan, X Huang, Q Liu, X Tan, Q Chen, W Xue, ...
arXiv preprint arXiv:2406.04321, 2024
12024
VLG: General Video Recognition with Web Textual Knowledge
J Lin, Z Liu, W Wang, W Wu, L Wang
International Journal of Computer Vision (IJCV) 2024, 2022
12022
Submission to generic event boundary detection challenge@ cvpr 2022: Local context modeling and global boundary decoding approach
J Tang, Z Liu, J Tan, C Qian, W Wu, L Wang
arXiv preprint arXiv:2206.15268, 2022
12022
MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
X Chi, Y Wang, A Cheng, P Fang, Z Tian, Y He, Z Liu, X Qi, J Pan, ...
arXiv preprint arXiv:2407.20962, 2024
2024
A Unified Pretraining Framework for Human Motion Analysis
W Zhu, X Ma, Z Liu, L Liu, W Wu, Y Wang
系统目前无法执行此操作,请稍后再试。
文章 1–20