[HTML][HTML] Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study

R Nirthika, S Manivannan, A Ramanan… - Neural Computing and …, 2022 - Springer
Convolutional neural networks (CNN) are widely used in computer vision and medical
image analysis as the state-of-the-art technique. In CNN, pooling layers are included mainly …

[HTML][HTML] Computer vision algorithms and hardware implementations: A survey

X Feng, Y Jiang, X Yang, M Du, X Li - Integration, 2019 - Elsevier
The field of computer vision is experiencing a great-leap-forward development today. This
paper aims at providing a comprehensive survey of the recent progress on computer vision …

Patch-netvlad: Multi-scale fusion of locally-global descriptors for place recognition

S Hausler, S Garg, M Xu, M Milford… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Visual Place Recognition is a challenging task for robotics and autonomous
systems, which must deal with the twin problems of appearance and viewpoint change in an …

Back to the feature: Learning robust camera localization from pixels to pose

PE Sarlin, A Unagar, M Larsson… - Proceedings of the …, 2021 - openaccess.thecvf.com
Camera pose estimation in known scenes is a 3D geometry task recently tackled by multiple
learning algorithms. Many regress precise geometric quantities, like poses or 3D points …

Mixvpr: Feature mixing for visual place recognition

A Ali-Bey, B Chaib-Draa… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Visual Place Recognition (VPR) is a crucial part of mobile robotics and autonomous
driving as well as other computer vision tasks. It refers to the process of identifying a place …

Visual place recognition: A survey from deep learning perspective

X Zhang, L Wang, Y Su - Pattern Recognition, 2021 - Elsevier
Visual place recognition has attracted widespread research interest in multiple fields such
as computer vision and robotics. Recently, researchers have employed advanced deep …

T2vlad: global-local sequence alignment for text-video retrieval

X Wang, L Zhu, Y Yang - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Text-video retrieval is a challenging task that aims to search relevant video contents based
on natural language descriptions. The key to this problem is to measure text-video …

Group normalization

Y Wu, K He - Proceedings of the European conference on …, 2018 - openaccess.thecvf.com
Batch Normalization (BN) is a milestone technique in the development of deep learning,
enabling various networks to train. However, normalizing along the batch dimension …

Unifying deep local and global features for image search

B Cao, A Araujo, J Sim - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Image retrieval is the problem of searching an image database for items that are similar to a
query image. To address this task, two main types of image representations have been …

Context encoding for semantic segmentation

H Zhang, K Dana, J Shi, Z Zhang… - Proceedings of the …, 2018 - openaccess.thecvf.com
Recent work has made significant progress in improving spatial resolution for pixelwise
labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous …