Unsupervised discovery of mid-level discriminative patches

J Gu, Z Wang, J Kuen, L Ma, A Shahroudy, B Shuai… - Pattern recognition, 2018 - Elsevier

In the last few years, deep learning has led to very good performance on a variety of
problems, such as visual recognition, speech recognition and natural language processing …

被引用次数：6887 相关文章所有 7 个版本

[PDF] wiley.com Full View

Deep learning for retail product recognition: Challenges and techniques

Y Wei, S Tran, S Xu, B Kang… - Computational …, 2020 - Wiley Online Library

Taking time to identify expected products and waiting for the checkout in a retail store are
common scenes we all encounter in our daily lives. The realization of automatic product …

被引用次数：128 相关文章所有 16 个版本

[PDF] thecvf.com

With a little help from my friends: Nearest-neighbor contrastive learning of visual representations

D Dwibedi, Y Aytar, J Tompson… - Proceedings of the …, 2021 - openaccess.thecvf.com

Self-supervised learning algorithms based on instance discrimination train encoders to be
invariant to pre-defined transformations of the same instance. While most methods treat …

被引用次数：518 相关文章所有 5 个版本

[PDF] aaai.org

A bottom-up clustering approach to unsupervised person re-identification

Y Lin, X Dong, L Zheng, Y Yan, Y Yang - … of the AAAI conference on artificial …, 2019 - aaai.org

Most person re-identification (re-ID) approaches are based on supervised learning, which
requires intensive manual annotation for training data. However, it is not only …

被引用次数：603 相关文章所有 11 个版本

[PDF] thecvf.com

Scaling and benchmarking self-supervised visual representation learning

P Goyal, D Mahajan, A Gupta… - Proceedings of the ieee …, 2019 - openaccess.thecvf.com

Self-supervised learning aims to learn representations from the data itself without explicit
manual supervision. Existing efforts ignore a crucial aspect of self-supervised learning-the …

被引用次数：489 相关文章所有 10 个版本

[PDF] thecvf.com

Interpretable convolutional neural networks

Q Zhang, YN Wu, SC Zhu - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com

This paper proposes a method to modify a traditional convolutional neural network (CNN)
into an interpretable CNN, in order to clarify knowledge representations in high conv-layers …

被引用次数：995 相关文章所有 10 个版本

[PDF] thecvf.com

InLoc: Indoor visual localization with dense matching and view synthesis

H Taira, M Okutomi, T Sattler… - Proceedings of the …, 2018 - openaccess.thecvf.com

We seek to predict the 6 degree-of-freedom (6DoF) pose of a query photograph with respect
to a large indoor 3D map. The contributions of this work are three-fold. First, we develop a …

被引用次数：545 相关文章所有 20 个版本

[PDF] cv-foundation.org

Training region-based object detectors with online hard example mining

A Shrivastava, A Gupta, R Girshick - Proceedings of the IEEE …, 2016 - cv-foundation.org

The field of object detection has made significant advances riding on the wave of region-
based ConvNets, but their training procedure still includes many heuristics and …

被引用次数：3018 相关文章所有 9 个版本

[PDF] thecvf.com

Unsupervised representation learning by sorting sequences

HY Lee, JB Huang, M Singh… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

We present an unsupervised representation learning approach using videos without
semantic labels. We leverage the temporal coherence as a supervisory signal by formulating …

被引用次数：669 相关文章所有 10 个版本

[PDF] arxiv.org

Shuffle and learn: unsupervised learning using temporal order verification

I Misra, CL Zitnick, M Hebert - … , The Netherlands, October 11–14, 2016 …, 2016 - Springer

In this paper, we present an approach for learning a visual representation from the raw
spatiotemporal signals in videos. Our representation is learned without supervision from …

被引用次数：1060 相关文章所有 4 个版本