Bridgetower: Building bridges between encoders in vision-language representation learning X Xu, C Wu, S Rosenman, V Lal, W Che, N Duan Proceedings of the AAAI Conference on Artificial Intelligence 37 (9), 10637 …, 2023 | 50 | 2023 |
Vl-interpret: An interactive visualization tool for interpreting vision-language transformers E Aflalo, M Du, SY Tseng, Y Liu, C Wu, N Duan, V Lal Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2022 | 39 | 2022 |
Kd-vlp: Improving end-to-end vision-and-language pretraining with object knowledge distillation Y Liu, C Wu, S Tseng, V Lal, X He, N Duan arXiv preprint arXiv:2109.10504, 2021 | 25 | 2021 |
Ldm3d: Latent diffusion model for 3d GBM Stan, D Wofk, S Fox, A Redden, W Saxton, J Yu, E Aflalo, SY Tseng, ... arXiv preprint arXiv:2305.10853, 2023 | 23 | 2023 |
Neurocounterfactuals: Beyond minimal-edit counterfactuals for richer data augmentation P Howard, G Singer, V Lal, Y Choi, S Swayamdipta arXiv preprint arXiv:2210.12365, 2022 | 22 | 2022 |
InterpreT: An interactive visualization tool for interpreting transformers V Lal, A Ma, E Aflalo, P Howard, A Simoes, D Korat, O Pereg, G Singer, ... Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 18 | 2021 |
Brain encoding models based on multimodal transformers can transfer across language and vision J Tang, M Du, V Vo, V Lal, A Huth Advances in Neural Information Processing Systems 36, 2024 | 16 | 2024 |
Improving video retrieval using multilingual knowledge transfer A Madasu, E Aflalo, G Ben Melech Stan, SY Tseng, G Bertasius, V Lal European Conference on Information Retrieval, 669-684, 2023 | 12 | 2023 |
Opinion-based relational pivoting for cross-domain aspect term extraction A Klein, O Pereg, D Korat, V Lal, M Wasserblat, I Dagan Proceedings of the 12th workshop on computational approaches to subjectivity …, 2022 | 12 | 2022 |
Cross-domain aspect extraction using transformers augmented with knowledge graphs P Howard, A Ma, V Lal, AP Simoes, D Korat, O Pereg, M Wasserblat, ... Proceedings of the 31st acm international conference on information …, 2022 | 10 | 2022 |
Coco-counterfactuals: Automatically constructed counterfactual examples for image-text pairs T Le, V Lal, P Howard Advances in Neural Information Processing Systems 36, 2024 | 8 | 2024 |
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models GBM Stan, RY Rohekar, Y Gurwicz, ML Olson, A Bhiwandiwalla, E Aflalo, ... arXiv preprint arXiv:2404.03118, 2024 | 5 | 2024 |
Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples P Howard, A Madasu, T Le, GL Moreno, A Bhiwandiwalla, V Lal arXiv preprint arXiv:2312.00825, 2023 | 4 | 2023 |
Neurocomparatives: Neuro-symbolic distillation of comparative knowledge P Howard, J Wang, V Lal, G Singer, Y Choi, S Swayamdipta arXiv preprint arXiv:2305.04978, 2023 | 4 | 2023 |
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model M Hinck, ML Olson, D Cobbley, SY Tseng, V Lal arXiv preprint arXiv:2404.01331, 2024 | 3 | 2024 |
Mumur: multilingual multimodal universal retrieval A Madasu, E Aflalo, GBM Stan, S Rosenman, SY Tseng, G Bertasius, ... Information Retrieval Journal 26 (1), 5, 2023 | 2 | 2023 |
Neuroprompts: An adaptive framework to optimize prompts for text-to-image generation S Rosenman, V Lal, P Howard arXiv preprint arXiv:2311.12229, 2023 | 2 | 2023 |
Probing intersectional biases in vision-language models with counterfactual examples P Howard, A Madasu, T Le, GL Moreno, V Lal arXiv preprint arXiv:2310.02988, 2023 | 2 | 2023 |
ManagerTower: Aggregating the insights of uni-modal experts for vision-language representation learning X Xu, B Li, C Wu, SY Tseng, A Bhiwandiwalla, S Rosenman, V Lal, W Che, ... arXiv preprint arXiv:2306.00103, 2023 | 2 | 2023 |
Getting it Right: Improving Spatial Consistency in Text-to-Image Models A Chatterjee, GBM Stan, E Aflalo, S Paul, D Ghosh, T Gokhale, L Schmidt, ... arXiv preprint arXiv:2404.01197, 2024 | 1 | 2024 |