If llm is the wizard, then code is the wand: A survey on how code empowers large language...

Y Ma, C Cui, X Cao, W Ye, P Liu, J Lu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Autonomous driving (AD) has made significant strides in recent years. However existing
frameworks struggle to interpret and execute spontaneous user instructions such as" …

被引用次数：12 相关文章所有 4 个版本

[PDF] arxiv.org

Receive, reason, and react: Drive as you say, with large language models in autonomous vehicles

C Cui, Y Ma, X Cao, W Ye… - IEEE Intelligent …, 2024 - ieeexplore.ieee.org

The fusion of human-centric design and artificial intelligence capabilities has opened up
new possibilities for next-generation autonomous vehicles that go beyond traditional …

被引用次数：26 相关文章所有 4 个版本

[PDF] arxiv.org

From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

KH Huang, HP Chan, YR Fung, H Qiu, M Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org

Data visualization in the form of charts plays a pivotal role in data analysis, offering critical
insights and aiding in informed decision-making. Automatic chart understanding has …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

When llms meet cybersecurity: A systematic literature review

J Zhang, H Bu, H Wen, Y Chen, L Li, H Zhu - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid advancements in large language models (LLMs) have opened new avenues
across various fields, including cybersecurity, which faces an ever-evolving threat landscape …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Infiagent-dabench: Evaluating agents on data analysis tasks

X Hu, Z Zhao, S Wei, Z Chai, G Wang, X Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

In this paper, we introduce" InfiAgent-DABench", the first benchmark specifically designed to
evaluate LLM-based agents in data analysis tasks. This benchmark contains DAEval, a …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

Knowagent: Knowledge-augmented planning for llm-based agents

Y Zhu, S Qiao, Y Ou, S Deng, N Zhang, S Lyu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models (LLMs) have demonstrated great potential in complex reasoning
tasks, yet they fall short when tackling more sophisticated challenges, especially when …

被引用次数：5 相关文章所有 2 个版本

[PDF] thecvf.com

MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

X Cao, T Zhou, Y Ma, W Ye, C Cui… - Proceedings of the …, 2024 - openaccess.thecvf.com

Vision-language generative AI has demonstrated remarkable promise for empowering cross-
modal scene understanding of autonomous driving and high-definition (HD) map systems …

[PDF] arxiv.org

Prompting large language models for zero-shot clinical prediction with structured longitudinal electronic health record data

Y Zhu, Z Wang, J Gao, Y Tong, J An, W Liao… - arXiv preprint arXiv …, 2024 - arxiv.org

The inherent complexity of structured longitudinal Electronic Health Records (EHR) data
poses a significant challenge when integrated with Large Language Models (LLMs), which …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Benchmarking ChatGPT on Algorithmic Reasoning

S McLeish, A Schwarzschild, T Goldstein - arXiv preprint arXiv:2404.03441, 2024 - arxiv.org

We evaluate ChatGPT's ability to solve algorithm problems from the CLRS benchmark suite
that is designed for GNNs. The benchmark requires the use of a specified classical algorithm …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models

F Zhang, J Li, YC Li, Z Zhang, Y Yu, D Ye - arXiv preprint arXiv …, 2024 - arxiv.org

Low sample efficiency is an enduring challenge of reinforcement learning (RL). With the
advent of versatile large language models (LLMs), recent works impart common-sense …