End-to-end autonomous driving: Challenges and frontiers
The autonomous driving community has witnessed a rapid growth in approaches that
embrace an end-to-end algorithm framework, utilizing raw sensor input to generate vehicle …
embrace an end-to-end algorithm framework, utilizing raw sensor input to generate vehicle …
Grid-centric traffic scenario perception for autonomous driving: A comprehensive review
Grid-centric perception is a crucial field for mobile robot perception and navigation.
Nonetheless, grid-centric perception is less prevalent than object-centric perception as …
Nonetheless, grid-centric perception is less prevalent than object-centric perception as …
Is sora a world simulator? a comprehensive survey on general world models and beyond
General world models represent a crucial pathway toward achieving Artificial General
Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual …
Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual …
Llm4drive: A survey of large language models for autonomous driving
Autonomous driving technology, a catalyst for revolutionizing transportation and urban
mobility, has the tend to transition from rule-based systems to data-driven strategies …
mobility, has the tend to transition from rule-based systems to data-driven strategies …
Sledge: Synthesizing driving environments with generative models and rule-based traffic
SLEDGE is the first generative simulator for vehicle motion planning trained on real-world
driving logs. Its core component is a learned model that is able to generate agent bounding …
driving logs. Its core component is a learned model that is able to generate agent bounding …
Bevworld: A multimodal world model for autonomous driving via unified bev latent space
World models are receiving increasing attention in autonomous driving for their ability to
predict potential future scenarios. In this paper, we present BEVWorld, a novel approach that …
predict potential future scenarios. In this paper, we present BEVWorld, a novel approach that …
Closed-loop visuomotor control with generative expectation for robotic manipulation
Despite significant progress in robotics and embodied AI in recent years, deploying robots
for long-horizon tasks remains a great challenge. Majority of prior arts adhere to an open …
for long-horizon tasks remains a great challenge. Majority of prior arts adhere to an open …
Covla: Comprehensive vision-language-action dataset for autonomous driving
Autonomous driving, particularly navigating complex and unanticipated scenarios, demands
sophisticated reasoning and planning capabilities. While Multi-modal Large Language …
sophisticated reasoning and planning capabilities. While Multi-modal Large Language …
CRASH: Crash Recognition and Anticipation System Harnessing with Context-Aware and Temporal Focus Attentions
Accurately and promptly predicting accidents among surrounding traffic agents from camera
footage is crucial for the safety of autonomous vehicles (AVs). This task presents substantial …
footage is crucial for the safety of autonomous vehicles (AVs). This task presents substantial …
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
World models can foresee the outcomes of different actions, which is of paramount
importance for autonomous driving. Nevertheless, existing driving world models still have …
importance for autonomous driving. Nevertheless, existing driving world models still have …