Deep adversarial metric learning for cross-modal retrieval X Xu, L He, H Lu, L Gao, Y Ji World Wide Web 22, 657-672, 2019 | 215 | 2019 |
Video captioning by adversarial LSTM Y Yang, J Zhou, J Ai, Y Bin, A Hanjalic, HT Shen, Y Ji IEEE Transactions on Image Processing 27 (11), 5600-5611, 2018 | 215 | 2018 |
Interactive body part contrast mining for human interaction recognition Y Ji, G Ye, H Cheng 2014 IEEE international conference on multimedia and expo workshops (ICMEW), 1-6, 2014 | 163 | 2014 |
Universal weighting metric learning for cross-modal matching J Wei, X Xu, Y Yang, Y Ji, Z Wang, HT Shen Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 98 | 2020 |
More is better: Precise and detailed image captioning using online positive recall and missing concepts mining M Zhang, Y Yang, H Zhang, Y Ji, HT Shen, TS Chua IEEE Transactions on Image Processing 28 (1), 32-44, 2018 | 81 | 2018 |
Cross-domain facial expression recognition via an intra-category common feature and inter-category distinction feature fusion network Y Ji, Y Hu, Y Yang, F Shen, HT Shen Neurocomputing 333, 231-239, 2019 | 79 | 2019 |
Multi-stage aggregated transformer network for temporal language localization in videos M Zhang, Y Yang, X Chen, Y Ji, X Xu, J Li, HT Shen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 75 | 2021 |
A large-scale RGB-D database for arbitrary-view human action recognition Y Ji, F Xu, Y Yang, F Shen, HT Shen, WS Zheng Proceedings of the 26th ACM international Conference on Multimedia, 1510-1518, 2018 | 72 | 2018 |
A survey of human action analysis in HRI applications Y Ji, Y Yang, F Shen, HT Shen, X Li IEEE Transactions on Circuits and Systems for Video Technology 30 (7), 2114-2128, 2019 | 71 | 2019 |
Learning contrastive feature distribution model for interaction recognition Y Ji, H Cheng, Y Zheng, H Li Journal of Visual Communication and Image Representation 33, 340-349, 2015 | 58 | 2015 |
Arbitrary-view human action recognition via novel-view action generation K Gedamu, Y Ji, Y Yang, LL Gao, HT Shen Pattern Recognition 118, 108043, 2021 | 47 | 2021 |
A context knowledge map guided coarse-to-fine action recognition Y Ji, Y Zhan, Y Yang, X Xu, F Shen, HT Shen IEEE Transactions on Image Processing 29, 2742-2752, 2019 | 41 | 2019 |
Recognition and detection of two-person interactive actions using automatically selected skeleton features H Wu, J Shao, X Xu, Y Ji, F Shen, HT Shen IEEE Transactions on Human-Machine Systems 48 (3), 304-310, 2017 | 40 | 2017 |
Gazing point dependent eye gaze estimation H Cheng, Y Liu, W Fu, Y Ji, L Yang, Y Zhao, J Yang Pattern Recognition 71, 36-44, 2017 | 38 | 2017 |
Partial feature selection and alignment for multi-source domain adaptation Y Fu, M Zhang, X Xu, Z Cao, C Ma, Y Ji, K Zuo, H Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 36 | 2021 |
Answer again: Improving VQA with cascaded-answering model L Peng, Y Yang, X Zhang, Y Ji, H Lu, HT Shen IEEE Transactions on Knowledge and Data Engineering 34 (4), 1644-1655, 2020 | 34 | 2020 |
Arbitrary-view human action recognition: A varying-view RGB-D action dataset Y Ji, Y Yang, F Shen, HT Shen, WS Zheng IEEE Transactions on Circuits and Systems for Video Technology 31 (1), 289-300, 2020 | 34 | 2020 |
Word-to-region attention network for visual question answering L Peng, Y Yang, Y Bin, N Xie, F Shen, Y Ji, X Xu Multimedia Tools and Applications 78, 3843-3858, 2019 | 32 | 2019 |
One-shot learning based pattern transition map for action early recognition Y Ji, Y Yang, X Xu, HT Shen Signal Processing 143, 364-370, 2018 | 32 | 2018 |
Cross-modal dynamic networks for video moment retrieval with text query G Wang, X Xu, F Shen, H Lu, Y Ji, HT Shen IEEE Transactions on Multimedia 24, 1221-1232, 2022 | 31 | 2022 |