M3: Multimodal memory modelling for video captioning J Wang, W Wang, Y Huang, L Wang, T Tan Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 187 | 2018 |
Learning visual relationship and context-aware attention for image captioning J Wang, W Wang, L Wang, Z Wang, DD Feng, T Tan Pattern Recognition 98, 107075, 2020 | 130 | 2020 |
Pose-guided multi-granularity attention network for text-based person search Y Jing, C Si, J Wang, W Wang, L Wang, T Tan Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 11189 …, 2020 | 122 | 2020 |
Stacked memory network for video summarization J Wang, W Wang, Z Wang, L Wang, D Feng, T Tan Proceedings of the 27th ACM international conference on multimedia, 836-844, 2019 | 65 | 2019 |
Hierarchical memory modelling for video captioning J Wang, W Wang, Y Huang, L Wang, T Tan Proceedings of the 26th ACM international conference on Multimedia, 63-71, 2018 | 19 | 2018 |
Relational graph neural network for situation recognition Y Jing, J Wang, W Wang, L Wang, T Tan Pattern Recognition 108, 107544, 2020 | 15 | 2020 |
Cascade attention network for person search: Both image and text-image similarity selection Y Jing, C Si, J Wang, W Wang, L Wang, T Tan arXiv preprint arXiv:1809.08440 2 (3), 5, 2018 | 13 | 2018 |
Pose-guided joint global and attentive local matching network for text-based person search Y Jing, C Si, J Wang, W Wang, L Wang, T Tan Association for the Advance of Artificial Intelligence (AAAI), 2020 | 12 | 2020 |
PreNet: Parallel Recurrent Neural Networks for Image Classification J Wang, W Wang, L Wang, T Tan Computer Vision: Second CCF Chinese Conference, CCCV 2017, Tianjin, China …, 2017 | 3 | 2017 |