Visual Reasoning and Multi-Agent Approach in Multimodal Large Language Models (MLLMs): Solving TSP and mTSP Combinatorial Challenges

M Elhenawy, A Abutahoun, TI Alhadidi, A Jaber… - arXiv preprint arXiv …, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) harness comprehensive knowledge spanning
text, images, and audio to adeptly tackle complex problems, including zero-shot in-context …