Towards Perceptual Image Dehazing by Physics-based Disentanglement and Adversarial Training X Yang, Z Xu, J Luo AAAI Conference on Artificial Intelligence (AAAI), 2018 | 247 | 2018 |
Cross-x learning for fine-grained visual categorization W Luo, X Yang, X Mo, Y Lu, LS Davis, J Li, J Yang, SN Lim Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 234 | 2019 |
STEP: Spatio-Temporal Progressive Learning for Video Action Detection X Yang, X Yang, MY Liu, F Xiao, L Davis, J Kautz Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019 | 171 | 2019 |
Deep multimodal representation learning from temporal data X Yang, P Ramesh, R Chitta, S Madhvanath, EA Bernal, J Luo Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 124 | 2017 |
Yang X Yang Yu, MD, 2018 | 97 | 2018 |
Asm-loc: Action-aware segment modeling for weakly-supervised temporal action localization B He, X Yang, L Kang, Z Cheng, X Zhou, A Shrivastava Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 82 | 2022 |
Tracking Illicit Drug Dealing and Abuse on Instagram Using Multimodal Analysis X Yang, J Luo ACM Transactions on Intelligent Systems and Technology (TIST) 8 (4), 2017 | 75 | 2017 |
Deep temporal multimodal fusion for medical procedure monitoring using wearable sensors EA Bernal, X Yang, Q Li, J Kumar, S Madhvanath, P Ramesh, R Bala IEEE Transactions on Multimedia 20 (1), 107-118, 2017 | 72 | 2017 |
Efficient video transformers with spatial-temporal token selection J Wang, X Yang, H Li, L Liu, Z Wu, YG Jiang European Conference on Computer Vision, 69-86, 2022 | 51 | 2022 |
Understanding the variational lower bound X Yang variational lower bound, ELBO, hard attention 22, 1-4, 2017 | 39 | 2017 |
Semi-supervised vision transformers Z Weng, X Yang, A Li, Z Wu, YG Jiang European conference on computer vision, 605-620, 2022 | 38 | 2022 |
Temporal fusion of multimodal data from multiple data acquisition systems to automatically recognize and classify an action X Yang, EA Bernal, S Madhvanath, R Bala, PS Ramesh, Q Li, J Kumar US Patent 9,805,255, 2017 | 36 | 2017 |
Pinterest board recommendation for twitter users X Yang, Y Li, J Luo Proceedings of the 23rd ACM international conference on Multimedia, 963-966, 2015 | 35 | 2015 |
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives K Grauman, A Westbury, L Torresani, K Kitani, J Malik, T Afouras, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 34 | 2024 |
Gta: Global temporal attention for video action understanding B He, X Yang, Z Wu, H Chen, SN Lim, A Shrivastava arXiv preprint arXiv:2012.08510, 2020 | 28 | 2020 |
Strong Baseline for Single Image Dehazing with Deep Features and Instance Normalization. Z Xu, X Yang, X Li, X Sun, P Harbin BMVC 2 (3), 5, 2018 | 24 | 2018 |
Open-vclip: Transforming clip to an open-vocabulary video model via interpolated weight optimization Z Weng, X Yang, A Li, Z Wu, YG Jiang International Conference on Machine Learning, 36978-36989, 2023 | 22 | 2023 |
Towards scalable neural representation for diverse videos B He, X Yang, H Wang, Z Wu, H Chen, S Huang, Y Ren, SN Lim, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 20 | 2023 |
Iterative spatio-temporal action detection in video X Yang, X Yang, X Fanyi, MY Liu, J Kautz US Patent 11,017,556, 2021 | 20 | 2021 |
Beyond short clips: End-to-end video-level learning with collaborative memories X Yang, H Fan, L Torresani, LS Davis, H Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 19 | 2021 |