Surveying the mllm landscape: A meta-review of current surveys
The rise of Multimodal Large Language Models (MLLMs) has become a transformative force
in the field of artificial intelligence, enabling machines to process and generate content …
in the field of artificial intelligence, enabling machines to process and generate content …
Towards general computer control: A multimodal agent for red dead redemption ii as a case study
Despite the success in specific tasks and scenarios, existing foundation agents, empowered
by large models (LMs) and advanced tools, still cannot generalize to different scenarios …
by large models (LMs) and advanced tools, still cannot generalize to different scenarios …
Multi-modal and multi-agent systems meet rationality: A survey
Rationality is characterized by logical thinking and decision-making that align with evidence
and logical rules. This quality is essential for effective problem-solving, as it ensures that …
and logical rules. This quality is essential for effective problem-solving, as it ensures that …
Ing-vp: Mllms cannot play easy vision-based games yet
As multimodal large language models (MLLMs) continue to demonstrate increasingly
competitive performance across a broad spectrum of tasks, more intricate and …
competitive performance across a broad spectrum of tasks, more intricate and …
Physgame: Uncovering physical commonsense violations in gameplay videos
Recent advancements in video-based large language models (Video LLMs) have witnessed
the emergence of diverse capabilities to reason and interpret dynamic visual content …
the emergence of diverse capabilities to reason and interpret dynamic visual content …
LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models
This paper presents a comprehensive survey of the current status and opportunities for
Large Language Models (LLMs) in strategic reasoning, a sophisticated form of reasoning …
Large Language Models (LLMs) in strategic reasoning, a sophisticated form of reasoning …
On large language models in national security applications
WN Caballero, PR Jenkins - arXiv preprint arXiv:2407.03453, 2024 - arxiv.org
The overwhelming success of GPT-4 in early 2023 highlighted the transformative potential of
large language models (LLMs) across various sectors, including national security. This …
large language models (LLMs) across various sectors, including national security. This …
Strago: Harnessing strategic guidance for prompt optimization
Prompt engineering is pivotal for harnessing the capabilities of large language models
(LLMs) across diverse applications. While existing prompt optimization methods improve …
(LLMs) across diverse applications. While existing prompt optimization methods improve …
Large Model Agents: State-of-the-Art, Cooperation Paradigms, Security and Privacy, and Future Trends
Large Model (LM) agents, powered by large foundation models such as GPT-4 and DALL-E
2, represent a significant step towards achieving Artificial General Intelligence (AGI). LM …
2, represent a significant step towards achieving Artificial General Intelligence (AGI). LM …
Odyssey: Empowering Minecraft Agents with Open-World Skills
S Liu, Y Li, K Zhang, Z Cui, W Fang, Y Zheng… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent studies have delved into constructing generalist agents for open-world environments
like Minecraft. Despite the encouraging results, existing efforts mainly focus on solving basic …
like Minecraft. Despite the encouraging results, existing efforts mainly focus on solving basic …