Multimedia search reranking: A literature survey

T Mei, Y Rui, S Li, Q Tian - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
The explosive growth and widespread accessibility of community-contributed media content
on the Internet have led to a surge of research activity in multimedia search. Approaches that …

Multimedia data mining: state of the art and challenges

CA Bhatt, MS Kankanhalli - Multimedia Tools and Applications, 2011 - Springer
Advances in multimedia data acquisition and storage technology have led to the growth of
very large multimedia databases. Analyzing this huge amount of multimedia data to discover …

Sequence to sequence-video to text

S Venugopalan, M Rohrbach… - Proceedings of the …, 2015 - openaccess.thecvf.com
Real-world videos often have complex dynamics; methods for generating open-domain
video descriptions should be senstive to temporal structure and allow both input (sequence …

Translating videos to natural language using deep recurrent neural networks

S Venugopalan, H Xu, J Donahue, M Rohrbach… - arXiv preprint arXiv …, 2014 - arxiv.org
Solving the visual symbol grounding problem has long been a goal of artificial intelligence.
The field appears to be advancing closer to this goal with recent breakthroughs in deep …

M3: Multimodal memory modelling for video captioning

J Wang, W Wang, Y Huang… - Proceedings of the …, 2018 - openaccess.thecvf.com
Video captioning which automatically translates video clips into natural language sentences
is a very important task in computer vision. By virtue of recent deep learning technologies …

Deep multi-view feature learning for person re-identification

D Tao, Y Guo, B Yu, J Pang, Z Yu - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Person re-identification aims to identify the same pedestrians across different camera views
at different locations. This important yet difficult intelligent video analysis problem remains a …

Saliency inside: Learning attentive CNNs for content-based image retrieval

S Wei, L Liao, J Li, Q Zheng, F Yang… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
In content-based image retrieval (CBIR), one of the most challenging and ambiguous tasks
is to correctly understand the human query intention and measure its semantic relevance …

Using linked data to annotate and search educational video resources for supporting distance learning

HQ Yu, C Pedrinaci, S Dietze… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
Multimedia educational resources play an important role in education, particularly for
distance learning environments. With the rapid growth of the multimedia web, large numbers …

Multiview cauchy estimator feature embedding for depth and inertial sensor-based human action recognition

Y Guo, D Tao, W Liu, J Cheng - IEEE Transactions on Systems …, 2016 - ieeexplore.ieee.org
The ever-growing popularity of Kinect and inertial sensors has prompted intensive research
efforts on human action recognition. Since human actions were extracted from Kinect and …

Nonlinear structural hashing for scalable video search

Z Chen, J Lu, J Feng, J Zhou - IEEE Transactions on Circuits …, 2017 - ieeexplore.ieee.org
In this paper, we propose a nonlinear structural hashing approach to learn compact binary
codes for scalable video search. Unlike most existing video hashing methods which …