Valeo4Cast: A Modular Approach to End-to-End Forecasting Y Xu, É Zablocki, A Boulch, G Puy, M Chen, F Bartoccioni, N Samet, ... arXiv preprint arXiv:2406.08113, 2024 | | 2024 |
A Concept-Based Explainability Framework for Large Multimodal Models J Parekh, P Khayatan, M Shukor, A Newson, M Cord arXiv preprint arXiv:2406.08074, 2024 | | 2024 |
Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features P Couairon, M Shukor, JE Haugeard, M Cord, N Thome arXiv preprint arXiv:2406.02842, 2024 | | 2024 |
Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs M Shukor, M Cord arXiv preprint arXiv:2405.16700, 2024 | | 2024 |
Towards Motion Forecasting with Real-World Perception Inputs: Are End-to-End Approaches Competitive? Y Xu, L Chambon, M Chen, A Alahi, M Cord, P Perez International Conference on Robotics and Automation (ICRA), 2024 | 5 | 2024 |
What matters when building vision-language models? H Laurençon, L Tronchon, M Cord, V Sanh arXiv preprint arXiv:2405.02246, 2024 | 13 | 2024 |
Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI H Caselles-Dupré, C Mellerio, P Hérent, A Lopez-Persem, B Béranger, ... arXiv preprint arXiv:2404.05468, 2024 | | 2024 |
What Makes Multimodal In-Context Learning Work? F Bertini Baldassini, M Shukor, M Cord, L Soulier, B Piwowarski arXiv e-prints, arXiv: 2404.15736, 2024 | | 2024 |
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models BT Corradini, M Shukor, P Couairon, G Couairon, F Scarselli, M Cord arXiv preprint arXiv:2403.20105, 2024 | | 2024 |
UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction L Feng, M Bahari, KMB Amor, É Zablocki, M Cord, A Alahi arXiv preprint arXiv:2403.15098, 2024 | | 2024 |
Improved baselines for data-efficient perceptual augmentation of llms T Vallaeys, M Shukor, M Cord, J Verbeek arXiv preprint arXiv:2403.13499, 2024 | 3 | 2024 |
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models B Toniella Corradini, M Shukor, P Couairon, G Couairon, F Scarselli, ... arXiv e-prints, arXiv: 2403.20105, 2024 | | 2024 |
GradPaint: Gradient-guided inpainting with diffusion models A Grechka, G Couairon, M Cord Computer Vision and Image Understanding 240, 103928, 2024 | 3 | 2024 |
What Makes Multimodal In-Context Learning Work? FB Baldassini, M Shukor, M Cord, L Soulier, B Piwowarski Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |
PointBeV: A Sparse Approach for BeV Predictions L Chambon, E Zablocki, M Chen, F Bartoccioni, P Pérez, M Cord Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |
Manipulating Trajectory Prediction with Backdoors K Massoud, K Grosse, M Chen, M Cord, P Pérez, A Alahi arXiv preprint arXiv:2312.13863, 2023 | | 2023 |
Reliability in Semantic Segmentation: Can We Use Synthetic Data? T Loiseau, TH Vu, M Chen, P Pérez, M Cord arXiv preprint arXiv:2312.09231, 2023 | | 2023 |
ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose Estimation C Rommel, V Letzelter, N Samet, R Marlet, M Cord, P Pérez, E Valle arXiv preprint arXiv:2312.06386, 2023 | | 2023 |
ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model EM Bakr, L Zhao, VT Hu, M Cord, P Perez, M Elhoseiny arXiv preprint arXiv:2311.14542, 2023 | | 2023 |
Obelisc: An open web-scale filtered dataset of interleaved image-text documents H Laurençon, L Saulnier, L Tronchon, S Bekman, A Singh, A Lozhkov, ... NeurIPS arXiv preprint arXiv:2306.16527, 2023 | 100 | 2023 |