Lampilot: An open benchmark dataset for autonomous driving with language model programs
Autonomous driving (AD) has made significant strides in recent years. However existing
frameworks struggle to interpret and execute spontaneous user instructions such as" …
frameworks struggle to interpret and execute spontaneous user instructions such as" …
Receive, reason, and react: Drive as you say, with large language models in autonomous vehicles
The fusion of human-centric design and artificial intelligence capabilities has opened up
new possibilities for next-generation autonomous vehicles that go beyond traditional …
new possibilities for next-generation autonomous vehicles that go beyond traditional …
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
Data visualization in the form of charts plays a pivotal role in data analysis, offering critical
insights and aiding in informed decision-making. Automatic chart understanding has …
insights and aiding in informed decision-making. Automatic chart understanding has …
When llms meet cybersecurity: A systematic literature review
J Zhang, H Bu, H Wen, Y Chen, L Li, H Zhu - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid advancements in large language models (LLMs) have opened new avenues
across various fields, including cybersecurity, which faces an ever-evolving threat landscape …
across various fields, including cybersecurity, which faces an ever-evolving threat landscape …
Infiagent-dabench: Evaluating agents on data analysis tasks
In this paper, we introduce" InfiAgent-DABench", the first benchmark specifically designed to
evaluate LLM-based agents in data analysis tasks. This benchmark contains DAEval, a …
evaluate LLM-based agents in data analysis tasks. This benchmark contains DAEval, a …
Knowagent: Knowledge-augmented planning for llm-based agents
Large Language Models (LLMs) have demonstrated great potential in complex reasoning
tasks, yet they fall short when tackling more sophisticated challenges, especially when …
tasks, yet they fall short when tackling more sophisticated challenges, especially when …
MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding
Vision-language generative AI has demonstrated remarkable promise for empowering cross-
modal scene understanding of autonomous driving and high-definition (HD) map systems …
modal scene understanding of autonomous driving and high-definition (HD) map systems …
Prompting large language models for zero-shot clinical prediction with structured longitudinal electronic health record data
The inherent complexity of structured longitudinal Electronic Health Records (EHR) data
poses a significant challenge when integrated with Large Language Models (LLMs), which …
poses a significant challenge when integrated with Large Language Models (LLMs), which …
Benchmarking ChatGPT on Algorithmic Reasoning
We evaluate ChatGPT's ability to solve algorithm problems from the CLRS benchmark suite
that is designed for GNNs. The benchmark requires the use of a specified classical algorithm …
that is designed for GNNs. The benchmark requires the use of a specified classical algorithm …
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
Low sample efficiency is an enduring challenge of reinforcement learning (RL). With the
advent of versatile large language models (LLMs), recent works impart common-sense …
advent of versatile large language models (LLMs), recent works impart common-sense …