Multimedia search reranking: A literature survey
The explosive growth and widespread accessibility of community-contributed media content
on the Internet have led to a surge of research activity in multimedia search. Approaches that …
on the Internet have led to a surge of research activity in multimedia search. Approaches that …
Multimedia data mining: state of the art and challenges
CA Bhatt, MS Kankanhalli - Multimedia Tools and Applications, 2011 - Springer
Advances in multimedia data acquisition and storage technology have led to the growth of
very large multimedia databases. Analyzing this huge amount of multimedia data to discover …
very large multimedia databases. Analyzing this huge amount of multimedia data to discover …
Sequence to sequence-video to text
S Venugopalan, M Rohrbach… - Proceedings of the …, 2015 - openaccess.thecvf.com
Real-world videos often have complex dynamics; methods for generating open-domain
video descriptions should be senstive to temporal structure and allow both input (sequence …
video descriptions should be senstive to temporal structure and allow both input (sequence …
Translating videos to natural language using deep recurrent neural networks
Solving the visual symbol grounding problem has long been a goal of artificial intelligence.
The field appears to be advancing closer to this goal with recent breakthroughs in deep …
The field appears to be advancing closer to this goal with recent breakthroughs in deep …
M3: Multimodal memory modelling for video captioning
Video captioning which automatically translates video clips into natural language sentences
is a very important task in computer vision. By virtue of recent deep learning technologies …
is a very important task in computer vision. By virtue of recent deep learning technologies …
Deep multi-view feature learning for person re-identification
Person re-identification aims to identify the same pedestrians across different camera views
at different locations. This important yet difficult intelligent video analysis problem remains a …
at different locations. This important yet difficult intelligent video analysis problem remains a …
Saliency inside: Learning attentive CNNs for content-based image retrieval
In content-based image retrieval (CBIR), one of the most challenging and ambiguous tasks
is to correctly understand the human query intention and measure its semantic relevance …
is to correctly understand the human query intention and measure its semantic relevance …
Using linked data to annotate and search educational video resources for supporting distance learning
Multimedia educational resources play an important role in education, particularly for
distance learning environments. With the rapid growth of the multimedia web, large numbers …
distance learning environments. With the rapid growth of the multimedia web, large numbers …
Multiview cauchy estimator feature embedding for depth and inertial sensor-based human action recognition
The ever-growing popularity of Kinect and inertial sensors has prompted intensive research
efforts on human action recognition. Since human actions were extracted from Kinect and …
efforts on human action recognition. Since human actions were extracted from Kinect and …
Nonlinear structural hashing for scalable video search
In this paper, we propose a nonlinear structural hashing approach to learn compact binary
codes for scalable video search. Unlike most existing video hashing methods which …
codes for scalable video search. Unlike most existing video hashing methods which …