Drivedreamer-2: Llm-enhanced world models for diverse driving video generation
World models have demonstrated superiority in autonomous driving, particularly in the
generation of multi-view driving videos. However, significant challenges still exist in
generating customized driving videos. In this paper, we propose DriveDreamer-2, which
builds upon the framework of DriveDreamer and incorporates a Large Language Model
(LLM) to generate user-defined driving videos. Specifically, an LLM interface is initially
incorporated to convert a user's query into agent trajectories. Subsequently, a HDMap …
generation of multi-view driving videos. However, significant challenges still exist in
generating customized driving videos. In this paper, we propose DriveDreamer-2, which
builds upon the framework of DriveDreamer and incorporates a Large Language Model
(LLM) to generate user-defined driving videos. Specifically, an LLM interface is initially
incorporated to convert a user's query into agent trajectories. Subsequently, a HDMap …
以上显示的是最相近的搜索结果。 查看全部搜索结果