- 学术资源搜索

Generative adversarial networks (GANs) for image augmentation in agriculture: A systematic review

Y Lu, D Chen, E Olaniyi, Y Huang - Computers and Electronics in …, 2022 - Elsevier

In agricultural image analysis, optimal model performance is keenly pursued for better
fulfilling visual recognition tasks (eg, image classification, segmentation, object detection …

被引用次数：154 相关文章所有 7 个版本

A systematic review and analysis of deep learning-based underwater object detection

S Xu, M Zhang, W Song, H Mei, Q He, A Liotta - Neurocomputing, 2023 - Elsevier

Underwater object detection is one of the most challenging research topics in computer
vision technology. The complex underwater environment makes underwater images suffer …

被引用次数：71 相关文章所有 3 个版本

[PDF] arxiv.org

Multimae: Multi-modal multi-task masked autoencoders

R Bachmann, D Mizrahi, A Atanov, A Zamir - European Conference on …, 2022 - Springer

We propose a pre-training strategy called Multi-modal Multi-task Masked Autoencoders
(MultiMAE). It differs from standard Masked Autoencoding in two key aspects: I) it can …

被引用次数：201 相关文章所有 6 个版本

[PDF] neurips.cc

Conflict-averse gradient descent for multi-task learning

B Liu, X Liu, X Jin, P Stone… - Advances in Neural …, 2021 - proceedings.neurips.cc

The goal of multi-task learning is to enable more efficient learning than single task learning
by sharing model structures for a diverse set of tasks. A standard multi-task learning …

被引用次数：232 相关文章所有 9 个版本

[PDF] thecvf.com

Defrcn: Decoupled faster r-cnn for few-shot object detection

L Qiao, Y Zhao, Z Li, X Qiu, J Wu… - Proceedings of the …, 2021 - openaccess.thecvf.com

Few-shot object detection, which aims at detecting novel objects rapidly from extremely few
annotated examples of previously unseen classes, has attracted significant research interest …

被引用次数：244 相关文章所有 5 个版本

[PDF] thecvf.com

Omnidata: A scalable pipeline for making multi-task mid-level vision datasets from 3d scans

A Eftekhar, A Sax, J Malik… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Computer vision now relies on data, but we know surprisingly little about what factors in the
data affect performance. We argue that this stems from the way data is collected. Designing …

被引用次数：177 相关文章所有 7 个版本

[PDF] arxiv.org

Fairmot: On the fairness of detection and re-identification in multiple object tracking

Y Zhang, C Wang, X Wang, W Zeng, W Liu - International journal of …, 2021 - Springer

Multi-object tracking (MOT) is an important problem in computer vision which has a wide
range of applications. Formulating MOT as multi-task learning of object detection and re-ID …

被引用次数：1275 相关文章所有 8 个版本

[PDF] arxiv.org

Persformer: 3d lane detection via perspective transformer and the openlane benchmark

L Chen, C Sima, Y Li, Z Zheng, J Xu, X Geng… - … on Computer Vision, 2022 - Springer

Methods for 3D lane detection have been recently proposed to address the issue of
inaccurate lane layouts in many autonomous driving scenarios (uphill/downhill, bump, etc.) …

被引用次数：129 相关文章所有 4 个版本

[PDF] thecvf.com

vmap: Vectorised object mapping for neural field slam

X Kong, S Liu, M Taher… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We present vMAP, an object-level dense SLAM system using neural field representations.
Each object is represented by a small MLP, enabling efficient, watertight object modelling …

被引用次数：55 相关文章所有 7 个版本

[PDF] thecvf.com

Towards large-scale 3d representation learning with multi-dataset point prompt training

X Wu, Z Tian, X Wen, B Peng, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

The rapid advancement of deep learning models is often attributed to their ability to leverage
massive training data. In contrast such privilege has not yet fully benefited 3D deep learning …

被引用次数：17 相关文章所有 3 个版本