[HTML][HTML] Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study
Convolutional neural networks (CNN) are widely used in computer vision and medical
image analysis as the state-of-the-art technique. In CNN, pooling layers are included mainly …
image analysis as the state-of-the-art technique. In CNN, pooling layers are included mainly …
[HTML][HTML] Computer vision algorithms and hardware implementations: A survey
X Feng, Y Jiang, X Yang, M Du, X Li - Integration, 2019 - Elsevier
The field of computer vision is experiencing a great-leap-forward development today. This
paper aims at providing a comprehensive survey of the recent progress on computer vision …
paper aims at providing a comprehensive survey of the recent progress on computer vision …
Patch-netvlad: Multi-scale fusion of locally-global descriptors for place recognition
Abstract Visual Place Recognition is a challenging task for robotics and autonomous
systems, which must deal with the twin problems of appearance and viewpoint change in an …
systems, which must deal with the twin problems of appearance and viewpoint change in an …
Back to the feature: Learning robust camera localization from pixels to pose
Camera pose estimation in known scenes is a 3D geometry task recently tackled by multiple
learning algorithms. Many regress precise geometric quantities, like poses or 3D points …
learning algorithms. Many regress precise geometric quantities, like poses or 3D points …
Mixvpr: Feature mixing for visual place recognition
A Ali-Bey, B Chaib-Draa… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Visual Place Recognition (VPR) is a crucial part of mobile robotics and autonomous
driving as well as other computer vision tasks. It refers to the process of identifying a place …
driving as well as other computer vision tasks. It refers to the process of identifying a place …
Visual place recognition: A survey from deep learning perspective
Visual place recognition has attracted widespread research interest in multiple fields such
as computer vision and robotics. Recently, researchers have employed advanced deep …
as computer vision and robotics. Recently, researchers have employed advanced deep …
T2vlad: global-local sequence alignment for text-video retrieval
Text-video retrieval is a challenging task that aims to search relevant video contents based
on natural language descriptions. The key to this problem is to measure text-video …
on natural language descriptions. The key to this problem is to measure text-video …
Group normalization
Batch Normalization (BN) is a milestone technique in the development of deep learning,
enabling various networks to train. However, normalizing along the batch dimension …
enabling various networks to train. However, normalizing along the batch dimension …
Unifying deep local and global features for image search
Image retrieval is the problem of searching an image database for items that are similar to a
query image. To address this task, two main types of image representations have been …
query image. To address this task, two main types of image representations have been …
Context encoding for semantic segmentation
Recent work has made significant progress in improving spatial resolution for pixelwise
labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous …
labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous …