Open-vocabulary object detection via vision and language knowledge distillation X Gu, TY Lin, W Kuo, Y Cui arXiv preprint arXiv:2104.13921, 2021 | 754 | 2021 |
Pali: A jointly-scaled multilingual language-image model X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ... arXiv preprint arXiv:2209.06794, 2022 | 493 | 2022 |
Expert-level detection of acute intracranial hemorrhage on head computed tomography using deep learning W Kuo, C Hӓne, P Mukherjee, J Malik, EL Yuh Proceedings of the National Academy of Sciences 116 (45), 22737-22745, 2019 | 262 | 2019 |
Deepbox: Learning objectness with convolutional networks W Kuo, B Hariharan, J Malik Proceedings of the IEEE international conference on computer vision, 2479-2487, 2015 | 220 | 2015 |
From lifestyle vlogs to everyday interactions DF Fouhey, W Kuo, AA Efros, J Malik Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 143 | 2018 |
ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors W Kuo, A Angelova, J Malik, TY Lin Proceedings of the IEEE international conference on computer vision, 2019 | 142 | 2019 |
F-vlm: Open-vocabulary object detection upon frozen vision and language models W Kuo, Y Cui, X Gu, AJ Piergiovanni, A Angelova arXiv preprint arXiv:2209.15639, 2022 | 138 | 2022 |
Cost-sensitive active learning for intracranial hemorrhage detection W Kuo, C Häne, E Yuh, P Mukherjee, J Malik Medical Image Computing and Computer Assisted Intervention–MICCAI 2018: 21st …, 2018 | 116 | 2018 |
Learning open-world object proposals without learning to classify D Kim, TY Lin, A Angelova, IS Kweon, W Kuo IEEE Robotics and Automation Letters 7 (2), 5453-5460, 2022 | 107 | 2022 |
Zero-shot detection via vision and language knowledge distillation X Gu, TY Lin, W Kuo, Y Cui arXiv preprint arXiv:2104.13921 2 (3), 4, 2021 | 88 | 2021 |
Mask2CAD: 3D shape prediction by learning to segment and retrieve W Kuo, A Angelova, TY Lin, A Dai Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 80 | 2020 |
Quantitative analysis of intrinsic skin aging in dermal papillae by in vivo harmonic generation microscopy YH Liao, WC Kuo, SY Chou, CS Tsai, GL Lin, MR Tsai, YT Shih, GG Lee, ... Biomedical optics express 5 (9), 3266-3279, 2014 | 62 | 2014 |
Rethinking video vits: Sparse video tubes for joint image and video learning AJ Piergiovanni, W Kuo, A Angelova Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 57 | 2023 |
Real-time three-dimensional optical coherence tomography image-guided core-needle biopsy system WC Kuo, J Kim, ND Shemonski, EJ Chaney, DR Spillman Jr, SA Boppart Biomedical optics express 3 (6), 1149-1161, 2012 | 55 | 2012 |
Region-aware pretraining for open-vocabulary object detection with vision transformers D Kim, A Angelova, W Kuo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 45 | 2023 |
Patch2cad: Patchwise embedding learning for in-the-wild shape retrieval from a single image W Kuo, A Angelova, TY Lin, A Dai Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 30 | 2021 |
PatchFCN for Intracranial Hemorrhage Detection W Kuo, C Häne, E Yuh, P Mukherjee, J Malik arXiv:1806.03265, 2018 | 21 | 2018 |
Mammut: A simple architecture for joint learning for multimodal tasks W Kuo, AJ Piergiovanni, D Kim, X Luo, B Caine, W Li, A Ogale, L Zhou, ... arXiv preprint arXiv:2303.16839, 2023 | 19 | 2023 |
Video question answering with iterative video-text co-tokenization AJ Piergiovanni, K Morton, W Kuo, MS Ryoo, A Angelova European Conference on Computer Vision, 76-94, 2022 | 18 | 2022 |
Answer-me: Multi-task open-vocabulary visual question answering AJ Piergiovanni, W Li, W Kuo, M Saffar, F Bertsch, A Angelova arXiv preprint arXiv:2205.00949, 2022 | 16 | 2022 |