Generative adversarial networks (GANs) for image augmentation in agriculture: A systematic review
In agricultural image analysis, optimal model performance is keenly pursued for better
fulfilling visual recognition tasks (eg, image classification, segmentation, object detection …
fulfilling visual recognition tasks (eg, image classification, segmentation, object detection …
A systematic review and analysis of deep learning-based underwater object detection
Underwater object detection is one of the most challenging research topics in computer
vision technology. The complex underwater environment makes underwater images suffer …
vision technology. The complex underwater environment makes underwater images suffer …
Multimae: Multi-modal multi-task masked autoencoders
We propose a pre-training strategy called Multi-modal Multi-task Masked Autoencoders
(MultiMAE). It differs from standard Masked Autoencoding in two key aspects: I) it can …
(MultiMAE). It differs from standard Masked Autoencoding in two key aspects: I) it can …
Conflict-averse gradient descent for multi-task learning
The goal of multi-task learning is to enable more efficient learning than single task learning
by sharing model structures for a diverse set of tasks. A standard multi-task learning …
by sharing model structures for a diverse set of tasks. A standard multi-task learning …
Defrcn: Decoupled faster r-cnn for few-shot object detection
Few-shot object detection, which aims at detecting novel objects rapidly from extremely few
annotated examples of previously unseen classes, has attracted significant research interest …
annotated examples of previously unseen classes, has attracted significant research interest …
Omnidata: A scalable pipeline for making multi-task mid-level vision datasets from 3d scans
Computer vision now relies on data, but we know surprisingly little about what factors in the
data affect performance. We argue that this stems from the way data is collected. Designing …
data affect performance. We argue that this stems from the way data is collected. Designing …
Fairmot: On the fairness of detection and re-identification in multiple object tracking
Multi-object tracking (MOT) is an important problem in computer vision which has a wide
range of applications. Formulating MOT as multi-task learning of object detection and re-ID …
range of applications. Formulating MOT as multi-task learning of object detection and re-ID …
Persformer: 3d lane detection via perspective transformer and the openlane benchmark
Methods for 3D lane detection have been recently proposed to address the issue of
inaccurate lane layouts in many autonomous driving scenarios (uphill/downhill, bump, etc.) …
inaccurate lane layouts in many autonomous driving scenarios (uphill/downhill, bump, etc.) …
vmap: Vectorised object mapping for neural field slam
We present vMAP, an object-level dense SLAM system using neural field representations.
Each object is represented by a small MLP, enabling efficient, watertight object modelling …
Each object is represented by a small MLP, enabling efficient, watertight object modelling …
Towards large-scale 3d representation learning with multi-dataset point prompt training
The rapid advancement of deep learning models is often attributed to their ability to leverage
massive training data. In contrast such privilege has not yet fully benefited 3D deep learning …
massive training data. In contrast such privilege has not yet fully benefited 3D deep learning …