Attention mechanisms and deep learning for machine vision: A survey of the state of the art
With the advent of state of the art nature-inspired pure attention based models ie
transformers, and their success in natural language processing (NLP), their extension to …
transformers, and their success in natural language processing (NLP), their extension to …
[HTML][HTML] Detection of glaucoma using retinal fundus images: A comprehensive review
Content-based image analysis and computer vision techniques are used in various health-
care systems to detect the diseases. The abnormalities in a human eye are detected through …
care systems to detect the diseases. The abnormalities in a human eye are detected through …
Salient object detection in the deep learning era: An in-depth survey
As an essential problem in computer vision, salient object detection (SOD) has attracted an
increasing amount of research attention over the years. Recent advances in SOD are …
increasing amount of research attention over the years. Recent advances in SOD are …
Deep visual attention prediction
In this paper, we aim to predict human eye fixation with view-free scenes based on an end-to-
end deep learning architecture. Although convolutional neural networks (CNNs) have made …
end deep learning architecture. Although convolutional neural networks (CNNs) have made …
Predicting human eye fixations via an lstm-based saliency attentive model
Data-driven saliency has recently gained a lot of attention thanks to the use of convolutional
neural networks for predicting gaze fixations. In this paper, we go beyond standard …
neural networks for predicting gaze fixations. In this paper, we go beyond standard …
Deep learning based image recognition for crack and leakage defects of metro shield tunnel
The performance of traditional visual inspection by handcrafted features for crack and
leakage defects of metro shield tunnel is hardly satisfactory nowadays because it is low …
leakage defects of metro shield tunnel is hardly satisfactory nowadays because it is low …
Crowdnet: A deep convolutional network for dense crowd counting
Our work proposes a novel deep learning framework for estimating crowd density from static
images of highly dense crowds. We use a combination of deep and shallow, fully …
images of highly dense crowds. We use a combination of deep and shallow, fully …
Revisiting video saliency prediction in the deep learning era
Predicting where people look in static scenes, aka visual saliency, has received significant
research interest recently. However, relatively less effort has been spent in understanding …
research interest recently. However, relatively less effort has been spent in understanding …
Salgan: Visual saliency prediction with generative adversarial networks
We introduce SalGAN, a deep convolutional neural network for visual saliency prediction
trained with adversarial examples. The first stage of the network consists of a generator …
trained with adversarial examples. The first stage of the network consists of a generator …
Gaze prediction in dynamic 360 immersive videos
This paper explores gaze prediction in dynamic $360^ circ $ immersive videos, emph {ie},
based on the history scan path and VR contents, we predict where a viewer will look at an …
based on the history scan path and VR contents, we predict where a viewer will look at an …