A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing

📅 2025-09-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
End-to-end autonomous driving faces three key challenges: multimodal action generation, insufficient temporal stability, and poor cross-scenario generalization. To address these, we propose a knowledge-driven diffusion policy framework that innovatively integrates conditional diffusion models with a sparse-gated Mixture-of-Experts (MoE) mechanism. The diffusion model captures multimodal, temporally coherent action distributions, while the MoE dynamically routes expert modules based on driving context, enabling interpretable and scalable knowledge composition. Departing from conventional deterministic policies, our generative approach enhances behavioral diversity and long-horizon consistency. Evaluated on multi-scenario benchmarks including CARLA, our method achieves significant improvements: +12.3% task success rate, −38.7% collision rate, and superior control smoothness and environmental adaptability—demonstrating both effectiveness and robustness.

Technology Category

Application Category

📝 Abstract
End-to-end autonomous driving remains constrained by the need to generate multi-modal actions, maintain temporal stability, and generalize across diverse scenarios. Existing methods often collapse multi-modality, struggle with long-horizon consistency, or lack modular adaptability. This paper presents KDP, a knowledge-driven diffusion policy that integrates generative diffusion modeling with a sparse mixture-of-experts routing mechanism. The diffusion component generates temporally coherent and multi-modal action sequences, while the expert routing mechanism activates specialized and reusable experts according to context, enabling modular knowledge composition. Extensive experiments across representative driving scenarios demonstrate that KDP achieves consistently higher success rates, reduced collision risk, and smoother control compared to prevailing paradigms. Ablation studies highlight the effectiveness of sparse expert activation and the Transformer backbone, and activation analyses reveal structured specialization and cross-scenario reuse of experts. These results establish diffusion with expert routing as a scalable and interpretable paradigm for knowledge-driven end-to-end autonomous driving.
Problem

Research questions and friction points this paper is trying to address.

Generating multi-modal actions for autonomous driving
Maintaining temporal stability in driving policies
Generalizing across diverse driving scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Diffusion policy for multi-modal action generation
Expert routing mechanism for contextual adaptability
Sparse mixture-of-experts enabling knowledge composition
🔎 Similar Papers
No similar papers found.
Chengkai Xu
Chengkai Xu
Tongji University
Autonomous DrivingReinforcement LearningMulti-modal Model
J
Jiaqi Liu
Department of Computer Science, University of North Carolina at Chapel Hill, United States
Y
Yicheng Guo
College of Transportation, Tongji University, Shanghai 201804, China
Peng Hang
Peng Hang
Tongji University
autonomous vehiclecooperative drivingdecision making and controlintelligent transportation
J
Jian Sun
College of Transportation, Tongji University, Shanghai 201804, China