基于轨迹优化的多模态端到端自动驾驶模型研究

蒋佳凯; 周亦威; 肖嗣伟

doi:10.16592/j.cnki.1004-7859.2026053

基于轨迹优化的多模态端到端自动驾驶模型研究

Research on Memory-Efficient Optimization-Based Multimodal End-to-End Autonomous Driving Model

摘要

摘要: 近年来端到端自动驾驶技术飞速发展，成为学术界和工业界的研究热点。端到端自动驾驶将感知、预测和规划整合为一个整体，以规划为目标，相较于传统的自动驾驶顺序执行的模块化设计，能有效地解决累计延误和模块协调的问题。然而，当前部分端到端自动驾驶模型的研究往往依赖于纯视觉，这使得模型检测能力十分受限，尤其在复杂的环境中，存在着许多安全隐患。因此，本文提出了一种基于轨迹优化的多模态端到端自动驾驶模型，该模型在端到端自动驾驶FusionAD框架的基础上，对规划模块的“特征交互与轨迹生成—安全优化建模—轨迹优化求解”三级结构提出了改进，使用了一种基于拟牛顿法改进的L-BFGS优化求解算法，替换了原有的基于牛顿法求解的IPOPT方法。本模型基于nuScenes数据集评估验证，并和各类主流算法比较，结果表明，L-BFGS的综合性能最好，碰撞率持平的基础上，平均位移误差减小了2%。此外，本文进一步将模型应用于nuScenes的黑夜场景中，激光雷达的引入使碰撞率降低了18.5%，并对比L-BFGS和各类主流算法，结果表明，模型在黑夜场景下轨迹碰撞率（CR_traj）下降了5.6%。此研究验证了激光雷达在自动驾驶的潜力，并提升了自动驾驶规划的安全性。

Abstract: In recent years, there has been stunning progress in the development of end-to-end autonomous driving technology, which constitutes a major research topic in both academia and industry. Vision-based approaches to fully autonomous driving have, from an end-to-end perspective, overcome most of the drawbacks presented by traditional modular designs. These focus mainly on sequential execution, where perception, prediction, and planning algorithms run as separate software modules. This paper discusses a multimodal end-to-end autonomous driving model based on trajectory optimization. It builds upon an already existing FusionAD framework for end-to-end autonomous driving and extends a three-level structure of planning modules for feature interaction and trajectory generation, safety optimization modeling, and trajectory optimization solver, further developing it by an enhanced L-BFGS optimization algorithm based on the quasi-Newton method to replace the original IPOPT methodology based on the Newton method. It is also benchmarked and validated on the nuScenes dataset for various elaborated models, compared with various mainstream algorithms. Results show that L-BFGS showed the best overall performance; it reduces the average displacement error by 2% at comparable collision rates. The model was then applied to the nighttime scenarios in nuScenes, where the integration of LiDAR resulted in an 18.5% reduction in collisions. The trajectory collision rate (CR_traj) in nighttime scenarios is reduced by 5.6%. The research, therefore, proves that incorporation supports LiDAR's significant role in autonomous driving and further ensures the safety of autonomous driving planning.

HTML全文

参考文献(0)

施引文献

资源附件(0)