Learning in high-dimensional multimedia data: the state of the art

L Gao, J Song, X Liu, J Shao, J Liu, J Shao - Multimedia Systems, 2017 - Springer
During the last decade, the deluge of multimedia data has impacted a wide range of
research areas, including multimedia retrieval, 3D tracking, database management, data …

A systematic review on content-based video retrieval

N Spolaôr, HD Lee, WSR Takaki, LA Ensina… - … Applications of Artificial …, 2020 - Elsevier
Content-based video retrieval and indexing have been associated with intelligent methods
in many applications such as education, medicine and agriculture. However, an extensive …

Deep multi-view enhancement hashing for image retrieval

C Yan, B Gong, Y Wei, Y Gao - IEEE Transactions on Pattern …, 2020 - ieeexplore.ieee.org
Hashing is an efficient method for nearest neighbor search in large-scale data space by
embedding high-dimensional feature descriptors into a similarity preserving Hamming …

Video captioning with attention-based LSTM and semantic consistency

L Gao, Z Guo, H Zhang, X Xu… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Recent progress in using long short-term memory (LSTM) for image captioning has
motivated the exploration of their applications for video captioning. By taking a video as a …

Provid: Progressive and multimodal vehicle reidentification for large-scale urban surveillance

X Liu, W Liu, T Mei, H Ma - IEEE Transactions on Multimedia, 2017 - ieeexplore.ieee.org
Compared with person reidentification, which has attracted concentrated attention, vehicle
reidentification is an important yet frontier problem in video surveillance and has been …

A survey on learning to hash

J Wang, T Zhang, N Sebe… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Nearest neighbor search is a problem of finding the data points from the database such that
the distances from them to the query point are the smallest. Learning to hash is one of the …

Aggregation-based graph convolutional hashing for unsupervised cross-modal retrieval

PF Zhang, Y Li, Z Huang, XS Xu - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Cross-modal hashing has sparked much attention in large-scale information retrieval for its
storage and query efficiency. Despite the great success achieved by supervised …

Multi-modal hashing for efficient multimedia retrieval: A survey

L Zhu, C Zheng, W Guan, J Li, Y Yang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the explosive growth of multimedia contents, multimedia retrieval is facing
unprecedented challenges on both storage cost and retrieval speed. Hashing technique can …

Hashing for similarity search: A survey

J Wang, HT Shen, J Song, J Ji - arXiv preprint arXiv:1408.2927, 2014 - arxiv.org
Similarity search (nearest neighbor search) is a problem of pursuing the data items whose
distances to a query item are the smallest from a large database. Various methods have …

Two-stream 3-d convnet fusion for action recognition in videos with arbitrary size and length

X Wang, L Gao, P Wang, X Sun… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
3-D convolutional neural networks (3-D-convNets) have been very recently proposed for
action recognition in videos, and promising results are achieved. However, existing 3-D …