Attention mechanisms and deep learning for machine vision: A survey of the state of the art

AM Hafiz, SA Parah, RUA Bhat - arXiv preprint arXiv:2106.07550, 2021 - arxiv.org
With the advent of state of the art nature-inspired pure attention based models ie
transformers, and their success in natural language processing (NLP), their extension to …

[HTML][HTML] Detection of glaucoma using retinal fundus images: A comprehensive review

A Shabbir, A Rasheed, H Shehraz… - Mathematical …, 2021 - aimspress.com
Content-based image analysis and computer vision techniques are used in various health-
care systems to detect the diseases. The abnormalities in a human eye are detected through …

Salient object detection in the deep learning era: An in-depth survey

W Wang, Q Lai, H Fu, J Shen, H Ling… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
As an essential problem in computer vision, salient object detection (SOD) has attracted an
increasing amount of research attention over the years. Recent advances in SOD are …

Deep visual attention prediction

W Wang, J Shen - IEEE Transactions on Image Processing, 2017 - ieeexplore.ieee.org
In this paper, we aim to predict human eye fixation with view-free scenes based on an end-to-
end deep learning architecture. Although convolutional neural networks (CNNs) have made …

Predicting human eye fixations via an lstm-based saliency attentive model

M Cornia, L Baraldi, G Serra… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Data-driven saliency has recently gained a lot of attention thanks to the use of convolutional
neural networks for predicting gaze fixations. In this paper, we go beyond standard …

Deep learning based image recognition for crack and leakage defects of metro shield tunnel

H Huang, Q Li, D Zhang - Tunnelling and underground space technology, 2018 - Elsevier
The performance of traditional visual inspection by handcrafted features for crack and
leakage defects of metro shield tunnel is hardly satisfactory nowadays because it is low …

Crowdnet: A deep convolutional network for dense crowd counting

L Boominathan, SSS Kruthiventi, RV Babu - Proceedings of the 24th …, 2016 - dl.acm.org
Our work proposes a novel deep learning framework for estimating crowd density from static
images of highly dense crowds. We use a combination of deep and shallow, fully …

Revisiting video saliency prediction in the deep learning era

W Wang, J Shen, J Xie, MM Cheng… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Predicting where people look in static scenes, aka visual saliency, has received significant
research interest recently. However, relatively less effort has been spent in understanding …

Salgan: Visual saliency prediction with generative adversarial networks

J Pan, CC Ferrer, K McGuinness, NE O'Connor… - arXiv preprint arXiv …, 2017 - arxiv.org
We introduce SalGAN, a deep convolutional neural network for visual saliency prediction
trained with adversarial examples. The first stage of the network consists of a generator …

Gaze prediction in dynamic 360 immersive videos

Y Xu, Y Dong, J Wu, Z Sun, Z Shi… - proceedings of the …, 2018 - openaccess.thecvf.com
This paper explores gaze prediction in dynamic $360^ circ $ immersive videos, emph {ie},
based on the history scan path and VR contents, we predict where a viewer will look at an …