GitHub - SunZhigang7/DiffSemanticFusion_MaplessQCNet: Official Code Release of DiffSemanticFusion [including Mapless QCNet], which achieves SOTA in both nuScenes and NAVSIM

DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion

Zhigang Sun^1*, Yiru Wang^1*†, Anqing Jiang^1*, Shuo Wang¹, Yu Gao¹, Yuwen Heng¹,
Shouyi Zhang¹, An He¹, Hao Jiang²，Jinhao Chai³, Zichong Gu³, Jijun Wang⁴,
Shichen Tang¹, Lavdim Halilaj⁵, Juergen Luettin⁵, Hao Sun¹

¹Bosch Corporate Research RIX
²Shanghai Jiaotong University ³Shanghai University
⁴AIR, Tsinghua University ⁵Robert Bosch GmbH

(*) Equal contribution. (†) Corresponding author.

Overview

Abstract

Autonomous driving requires accurate scene understanding, including road geometry, traffic agents, and their semantic relationships. In online HD map generation scenarios, raster-based representations are well-suited to vision models but lack geometric precision, while graph-based representations retain structural detail but become unstable without precise maps. To harness the complementary strengths of both, we propose DiffSemanticFusion—a fusion framework for multimodal trajectory prediction and planning. Our approach reasons over a semantic raster–fused BEV space, enhanced by a map diffusion module that improves both the stability and expressiveness of online HD map representations. We validate our framework on two downstream tasks: trajectory prediction and planning-oriented end-to-end autonomous driving. Experiments on real-world autonomous driving benchmarks, nuScenes and NAVSIM, demonstrate improved performance over several state-of-the-art (SOTA) methods. For the prediction task on nuScenes, we integrate DiffSemanticFusion with the online HD map informed QCNet, achieving a 5.1% performance improvement. For end-to-end autonomous driving in NAVSIM, DiffSemanticFusion achieves SOTA results, with a 15% performance gain in NavHard scenarios. In addition, extensive ablation and sensitivity studies show that our map diffusion module can be seamlessly integrated into other vector-based approaches to enhance performance.

News

[2025/08/06] ArXiv paper release. Code/Models are coming soon. Please stay tuned! ☕️
[2025/08/07] Open source the mapless QCNet, as we train the model on GPU Cluster, also with the stdout and stderr
[2025/08/19] Provide quite detail training and eval process of Mapless QCNet, as shown in computing_jobs/qc_net_mapless_prediction_train.8564846.stdout
[2025/08/19] Initial Update for DiffsemanticFusion, please switch diffsemanticfusion branch to check

Updates

We are going to release code step by step:

Note: Due to policy, SemanticFormer can't be open source, so we only open source homogeneous graph fusion with BEV

Note: Code needs to be cleaned and I will open source all the code within one month. As I promised.

📄 Citation

If you find DiffSemanticFusion is useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@misc{sun2025diffsemanticfusionsemanticrasterbev,
      title={DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion}, 
      author={Zhigang Sun and Yiru Wang and Anqing Jiang and Shuo Wang and Yu Gao and Yuwen Heng and Shouyi Zhang and An He and Hao Jiang and Jinhao Chai and Zichong Gu and Wang Jijun and Shichen Tang and Lavdim Halilaj and Juergen Luettin and Hao Sun},
      year={2025},
      eprint={2508.01778},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2508.01778}, 
}

@article{sun2024semanticformer,
  title={Semanticformer: Holistic and semantic traffic scene representation for trajectory prediction using knowledge graphs},
  author={Sun, Zhigang and Wang, Zixu and Halilaj, Lavdim and Luettin, Juergen},
  journal={IEEE Robotics and Automation Letters},
  year={2024},
  publisher={IEEE}
}

License

Shield:

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, with additional term described by the nuScenes Terms of use, in particular the "Licenses" section.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
computing_jobs		computing_jobs
datamodules		datamodules
datasets		datasets
layers		layers
losses		losses
metrics		metrics
modules		modules
predictors		predictors
resources		resources
transforms		transforms
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
setup.py		setup.py
test.py		test.py
train_mapless_qcnet_nuscenes.py		train_mapless_qcnet_nuscenes.py
val_mapless_qcnet_nuscenes.py		val_mapless_qcnet_nuscenes.py
val_mapless_qcnet_nuscenes_unet_diffusion.py		val_mapless_qcnet_nuscenes_unet_diffusion.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion

Overview

Abstract

News

Updates

📄 Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion

Overview

Abstract

News

Updates

📄 Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages