Multi-task deep learning for medical image computing and analysis: A review

Y Zhao, X Wang, T Che, G Bao, S Li - Computers in Biology and Medicine, 2023 - Elsevier
The renaissance of deep learning has provided promising solutions to various tasks. While
conventional deep learning models are constructed for a single specific task, multi-task deep …

Advances and challenges in meta-learning: A technical review

A Vettoruzzo, MR Bouguelia… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Meta-learning empowers learning systems with the ability to acquire knowledge from
multiple tasks, enabling faster adaptation and generalization to new tasks. This review …

Bevformer: Learning bird's-eye-view representation from multi-camera images via spatiotemporal transformers

Z Li, W Wang, H Li, E Xie, C Sima, T Lu, Y Qiao… - European conference on …, 2022 - Springer
Abstract 3D visual perception tasks, including 3D detection and map segmentation based on
multi-camera images, are essential for autonomous driving systems. In this work, we present …

Petrv2: A unified framework for 3d perception from multi-camera images

Y Liu, J Yan, F Jia, S Li, A Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose PETRv2, a unified framework for 3D perception from multi-view
images. Based on PETR, PETRv2 explores the effectiveness of temporal modeling, which …

Sam-clip: Merging vision foundation models towards semantic and spatial understanding

H Wang, PKA Vasu, F Faghri… - Proceedings of the …, 2024 - openaccess.thecvf.com
The landscape of publicly available vision foundation models (VFMs) such as CLIP and
SAM is expanding rapidly. VFMs are endowed with distinct capabilities stemming from their …

Ties-merging: Resolving interference when merging models

P Yadav, D Tam, L Choshen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Transfer learning–ie, further fine-tuning a pre-trained model on a downstream task–can
confer significant advantages, including improved downstream performance, faster …

MBEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation

E Xie, Z Yu, D Zhou, J Philion, A Anandkumar… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we propose M $^ 2$ BEV, a unified framework that jointly performs 3D object
detection and map segmentation in the Birds Eye View~(BEV) space with multi-camera …

Ext5: Towards extreme multi-task scaling for transfer learning

V Aribandi, Y Tay, T Schuster, J Rao, HS Zheng… - arXiv preprint arXiv …, 2021 - arxiv.org
Despite the recent success of multi-task learning and transfer learning for natural language
processing (NLP), few works have systematically studied the effect of scaling up the number …

Do current multi-task optimization methods in deep learning even help?

D Xin, B Ghorbani, J Gilmer… - Advances in neural …, 2022 - proceedings.neurips.cc
Recent research has proposed a series of specialized optimization algorithms for deep multi-
task models. It is often claimed that these multi-task optimization (MTO) methods yield …

Real-world image super-resolution as multi-task learning

W Zhang, X Li, G Shi, X Chen, Y Qiao… - Advances in …, 2024 - proceedings.neurips.cc
In this paper, we take a new look at real-world image super-resolution (real-SR) from a multi-
task learning perspective. We demonstrate that the conventional formulation of real-SR can …