Lampilot: An open benchmark dataset for autonomous driving with language model programs

Y Ma, C Cui, X Cao, W Ye, P Liu, J Lu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Autonomous driving (AD) has made significant strides in recent years. However existing
frameworks struggle to interpret and execute spontaneous user instructions such as" …

Receive, reason, and react: Drive as you say, with large language models in autonomous vehicles

C Cui, Y Ma, X Cao, W Ye… - IEEE Intelligent …, 2024 - ieeexplore.ieee.org
The fusion of human-centric design and artificial intelligence capabilities has opened up
new possibilities for next-generation autonomous vehicles that go beyond traditional …

From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

KH Huang, HP Chan, YR Fung, H Qiu, M Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
Data visualization in the form of charts plays a pivotal role in data analysis, offering critical
insights and aiding in informed decision-making. Automatic chart understanding has …

When llms meet cybersecurity: A systematic literature review

J Zhang, H Bu, H Wen, Y Chen, L Li, H Zhu - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid advancements in large language models (LLMs) have opened new avenues
across various fields, including cybersecurity, which faces an ever-evolving threat landscape …

Infiagent-dabench: Evaluating agents on data analysis tasks

X Hu, Z Zhao, S Wei, Z Chai, G Wang, X Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce" InfiAgent-DABench", the first benchmark specifically designed to
evaluate LLM-based agents in data analysis tasks. This benchmark contains DAEval, a …

Knowagent: Knowledge-augmented planning for llm-based agents

Y Zhu, S Qiao, Y Ou, S Deng, N Zhang, S Lyu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) have demonstrated great potential in complex reasoning
tasks, yet they fall short when tackling more sophisticated challenges, especially when …

MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding

X Cao, T Zhou, Y Ma, W Ye, C Cui… - Proceedings of the …, 2024 - openaccess.thecvf.com
Vision-language generative AI has demonstrated remarkable promise for empowering cross-
modal scene understanding of autonomous driving and high-definition (HD) map systems …

Prompting large language models for zero-shot clinical prediction with structured longitudinal electronic health record data

Y Zhu, Z Wang, J Gao, Y Tong, J An, W Liao… - arXiv preprint arXiv …, 2024 - arxiv.org
The inherent complexity of structured longitudinal Electronic Health Records (EHR) data
poses a significant challenge when integrated with Large Language Models (LLMs), which …

Benchmarking ChatGPT on Algorithmic Reasoning

S McLeish, A Schwarzschild, T Goldstein - arXiv preprint arXiv:2404.03441, 2024 - arxiv.org
We evaluate ChatGPT's ability to solve algorithm problems from the CLRS benchmark suite
that is designed for GNNs. The benchmark requires the use of a specified classical algorithm …

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models

F Zhang, J Li, YC Li, Z Zhang, Y Yu, D Ye - arXiv preprint arXiv …, 2024 - arxiv.org
Low sample efficiency is an enduring challenge of reinforcement learning (RL). With the
advent of versatile large language models (LLMs), recent works impart common-sense …