Deep learning for instance retrieval: A survey
In recent years a vast amount of visual content has been generated and shared from many
fields, such as social media platforms, medical imaging, and robotics. This abundance of …
fields, such as social media platforms, medical imaging, and robotics. This abundance of …
SDBAD-Net: A spatial dual-branch attention dehazing network based on meta-former paradigm
Image dehazing is an emblematical low-level vision task that aims at restoring haze-free
images from haze images. Recently, some methods adopts deep learning techniques to …
images from haze images. Recently, some methods adopts deep learning techniques to …
Exploring optical-flow-guided motion and detection-based appearance for temporal sentence grounding
Temporal sentence grounding aims to localize a target segment in an untrimmed video
semantically according to a given sentence query. Most previous works focus on learning …
semantically according to a given sentence query. Most previous works focus on learning …
CNDesc: Cross normalization for local descriptors learning
For a long time, the local descriptors learning benefited from the use of L2 normalization,
which projects the descriptor space onto the hypersphere. However, there is no free lunch in …
which projects the descriptor space onto the hypersphere. However, there is no free lunch in …
Graph convolutional network discrete hashing for cross-modal retrieval
C Bai, C Zeng, Q Ma, J Zhang - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org
With the rapid development of deep neural networks, cross-modal hashing has made great
progress. However, the information of different types of data is asymmetrical, that is to say, if …
progress. However, the information of different types of data is asymmetrical, that is to say, if …
OMCBIR: Offline mobile content-based image retrieval with lightweight CNN optimization
Abstract Convolutional Neural Networks (CNNs) have achieved great success in computer
vision applications. However, due to the high requirements for computation power and …
vision applications. However, due to the high requirements for computation power and …
Advanced dropout: A model-free methodology for bayesian dropout optimization
Due to lack of data, overfitting ubiquitously exists in real-world applications of deep neural
networks (DNNs). We propose advanced dropout, a model-free methodology, to mitigate …
networks (DNNs). We propose advanced dropout, a model-free methodology, to mitigate …
Label semantic knowledge distillation for unbiased scene graph generation
The Scene Graph Generation (SGG) task aims to detect all the objects and their pairwise
visual relationships in a given image. Although SGG has achieved remarkable progress …
visual relationships in a given image. Although SGG has achieved remarkable progress …
Lipformer: learning to lipread unseen speakers based on visual-landmark transformers
F Xue, Y Li, D Liu, Y Xie, L Wu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Lipreading refers to understanding and further translating the speech of a video speaker into
textual outputs. State-of-the-art lipreading methods excel in interpreting overlap speakers, ie …
textual outputs. State-of-the-art lipreading methods excel in interpreting overlap speakers, ie …
Forward propagation dropout in deep neural networks using Jensen–Shannon and random forest feature importance ranking
M Heidari, MH Moattar, H Ghaffari - Neural Networks, 2023 - Elsevier
Dropout is a mechanism to prevent deep neural networks from overfitting and improving
their generalization. Random dropout is the simplest method, where nodes are randomly …
their generalization. Random dropout is the simplest method, where nodes are randomly …