Skip to content

dl-m9/Multi-Agent-Embodied-Autonomous-Driving

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

100 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Awesome Multi-Agent Autonomous Driving 🚗 🚙 🚓 🚕 🏎️

MAAD

This is a repository for collecting resources about Multi-Agent Autonomous Driving (MAAD). Different from single-agent autonomous driving which mainly focus on enhancing the driving capabilities of a single vehicle, MAAD focuses on the collaboration and interaction between multiple agents including vehicles and infrastructure.

If you want to understand the FULL-STACK technology of MULTI-AGENT AUTONOMOUS DRIVING, then this repo is definitely for you!

Come and Join Us! 👊🇨🇳🔥

Contribution

Feel free to pull requests or contact us if you find any related papers that are not included here.

The process to submit a pull request is as follows:

  1. Fork the project into your own repository.
  2. Add the Title, Paper link, Conference, Project/Code link in papers.md using the following format:
  `[Journal/Conference]` Paper Title [Code/Project](Code/Project link)
  1. Submit the pull request to this branch.

Table of Contents

Related Materials

Surveys

  1. [TPAMI'24] 3D Object Detection From Images for Autonomous Driving: A Survey [PDF]
  2. [TITS'24] A Survey on Recent Advancements in Autonomous Driving Using Deep Reinforcement Learning: Applications, Challenges, and Solutions [PDF]
  3. [ESWA] Autonomous driving system: A comprehensive survey [PDF]
  4. [TPAMI'24] Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe [PDF]
  5. [TPAMI] End-to-End Autonomous Driving: Challenges and Frontiers [PDF, Code]
  6. [arXiv] LLM4Drive: A Survey of Large Language Models for Autonomous Driving [PDF, Code]
  7. [arXiv] Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances [PDF, Code]
  8. [WACV Workshop] A Survey on Multimodal Large Language Models for Autonomous Driving [PDF]
  9. [arXiv] A Survey of Reasoning with Foundation Models [PDF, Code]
  10. [arXiv] Collaborative Perception for Connected and Autonomous Driving: Challenges, Possible Solutions and Opportunities [PDF]
  11. [Annual Review of Control, Robotics, and Autonomous Systems] Planning and decision-making for autonomous vehicles [PDF]
  12. [Chinese Journal of Mechanical Engineering] Planning and Decision-making for Connected Autonomous Vehicles at Road Intersections: A Review [PDF]
  13. [COMST'22] A Survey of Collaborative Machine Learning Using 5G Vehicular Communications [PDF]
  14. [Proceedings of the IEEE] 6G for Vehicle-to-Everything (V2X) Communications: Enabling Technologies, Challenges, and Opportunities [PDF]
  15. [arXiv'25] Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future [PDF]
  16. [ICCV'25 Workshop] A Survey on Vision-Language-Action Models for Autonomous Driving [PDF]
  17. [IEEE Trans'25] Large (Vision) Language Models for Autonomous Vehicles: Current Trends and Future Directions [PDF]
  18. [IEEE Trans'25] Multi-agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects [PDF]
  19. [IEEE'25] Cooperative Perception for Automated Driving: A Survey of Algorithms, Applications, and Future Directions [PDF]
  20. [arXiv'25] Collaborative Perception Datasets for Autonomous Driving: A Review [PDF]
  21. [arXiv'25] Cooperative Safety Intelligence in V2X-Enabled Transportation: A Survey [PDF]
  22. [arXiv'25] Recent Advances in Multi-Agent Human Trajectory Prediction [PDF]
  23. [arXiv'25] Towards Vehicle-to-Everything Autonomous Driving: A Survey on Collaborative Perception [PDF]
  24. [arXiv'25] V2X Cooperative Perception for Autonomous Driving: Recent Advances and Challenges [PDF]

Github Repos

  1. Awesome Autonomous Driving
  2. Autonomous Driving Datasets
  3. Awesome 3D Object Detection for Autonomous Driving
  4. CVPR 2024 Papers on Autonomous Driving
  5. End-to-End Autonomous Driving
  6. End-to-End Autonomous Driving (OpenDriveLab)
  7. Collaborative Perception
  8. Vision Language Models in Autonomous Driving and ITS

Paper Collection

Perception

  1. [ICCV'25 Workshop] Learning 3D Perception from Others' Predictions (R&B-POP) [PDF] [Code]
  2. [ICCV'25 Workshop] RG-Attn: Radian Glue Attention for Multi-modal Multi-agent Cooperative Perception [PDF]
  3. [ICCV'25 Workshop] MIC-BEV: Infrastructure-Based Multi-Camera Bird's-Eye-View Perception Transformer for 3D Object Detection [PDF]
  4. [ICCV'25 Workshop] SlimComm: Doppler-Guided Sparse Queries for Bandwidth-Efficient Cooperative 3-D Perception [PDF]
  5. [ICCV'25 Workshop] D3FNet: A Differential Attention Fusion Network for Fine-Grained Road Structure Extraction in Remote Perception Systems [PDF]
  6. [ICCV'25 Workshop] Understanding What Vision-Language Models See in Traffic: PixelSHAP for Object-Level Attribution in Autonomous Driving [PDF]
  7. [ICCV'25 Workshop] Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection [PDF]
  8. [ICCV'25 Workshop] Cross-camera Monocular 3D Detection for Autonomous Racing [PDF]
  9. [ICRA'25] CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query [PDF]
  10. [arXiv'25] V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models [PDF] [Code] [Webpage]
  11. [Electronics'25] Vision-Language Models for Autonomous Driving: CLIP-based Dynamic Scene Understanding [PDF]
  12. [CVPR'25] CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization [PDF] [Code]
  13. [CVPR'25] One is Plenty: A Polymorphic Feature Interpreter for Immutable Heterogeneous Collaborative Perception [PDF]
  14. [CVPR'25] SparseAlign: A Fully Sparse Framework for Cooperative Object Detection [PDF]
  15. [CVPR'25] Trajectory-aware Feature Alignment for Asynchronous Multi-Agent Perception [PDF]
  16. [ICCV'25] V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction [PDF]
  17. [ICCV'25] mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework [PDF]
  18. [AAAI'25] Privacy-Preserving V2X Collaborative Perception [PDF]
  19. [IEEE'25] SCOPE++: Robust Multi-Agent Collaborative Perception via Spatio-Temporal Awareness [PDF]
  20. [IEEE VTC'25] Robust Multi-Agent Collaborative Perception via Triple-Attention and Dynamic Gating [PDF]
  21. [arXiv'25] FocalComm: Hard Instance-Aware Multi-Agent Perception [PDF]
  22. [NeurIPS'25] STAMP: Scalable Task- And Model-Agnostic Collaborative Perception [PDF] [Code]
  23. [arXiv'25] CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model [PDF]
  24. [arXiv'25] CoMamba: Real-Time Cooperative Perception Unlocked with State Space Models [PDF]
  25. [arXiv'25] CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection [PDF]
  26. [arXiv'25] DiffCP: Ultra-Low Bit Collaborative Perception via Diffusion Model [PDF]
  27. [arXiv'25] QuantV2X: A Fully Quantized Multi-Agent System for Cooperative Perception [PDF]
  28. [arXiv'25] ReVQom: Residual Vector Quantization For Communication-Efficient Multi-Agent Perception [PDF]
  29. [arXiv'25] CoBEVGlue: Self-Localized Collaborative Perception [PDF]
  30. [arXiv'25] RDComm: Rate-Distortion Optimized Communication for Collaborative Perception [PDF]
  31. [arXiv'25] JigsawComm: Joint Semantic Feature Encoding and Transmission for Communication-Efficient Cooperative Perception [PDF]
  32. [arXiv'25] FadeLead: Curriculum-Guided Background Pruning for Efficient Foreground-Centric Collaborative Perception [PDF]
  33. [arXiv'25] ParCon: Noise-Robust Collaborative Perception via Multi-Module Parallel Connection [PDF]
  34. [arXiv'25] SiCP: Simultaneous Individual and Cooperative Perception for 3D Object Detection in Connected and Automated Vehicles [PDF]
  35. [TITS'24] Toward Full-Scene Domain Generalization in Multi-Agent Collaborative Bird's Eye View Segmentation for Connected and Autonomous Driving [PDF]
  36. [CVPR'24] Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles [PDF] [Code]
  37. [ECCV'24] Hetecooper: Feature Collaboration Graph for Heterogeneous Collaborative Perception [PDF]
  38. [ECCV'24] Rethinking the Role of Infrastructure in Collaborative Perception [PDF]
  39. [NeurIPS'24] Learning Cooperative Trajectory Representations for Motion Forecasting [PDF] [Code]
  40. [AAAI'24] What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception [PDF] [Code]
  41. [AAAI'24] DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection [PDF] [Code]
  42. [WACV'24] MACP: Efficient Model Adaptation for Cooperative Perception [PDF] [Code]
  43. [ICRA'24] Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter [PDF] [Code]
  44. [ICRA'24] Robust Collaborative Perception without External Localization and Clock Devices [PDF] [Code]
  45. [ICCV'23] Spatio-Temporal Domain Awareness for Multi-Agent Collaborative Perception [PDF] [Webpage]
  46. [ICCV'23] HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative Perception with Vision Transformer [PDF] [Code]
  47. [ICCV'23] CORE: Cooperative Reconstruction for Multi-Agent Perception [PDF] [Code]
  48. [ICCV'23] Spatio-Temporal Domain Awareness for Multi-Agent Collaborative Perception [PDF] [Code]
  49. [ICCV'23] TransIFF: An Instance-Level Feature Fusion Framework for Vehicle-Infrastructure Cooperative 3D Detection with Transformers [PDF]
  50. [ICCV'23] UMC: A Unified Bandwidth-Efficient and Multi-Resolution Based Collaborative Perception Framework [PDF] [Code]
  51. [CVPR'23] Query-Centric Trajectory Prediction [PDF] [Code]
  52. [CVPR'23] Collaboration Helps Camera Overtake LiDAR in 3D Detection [PDF] [Code]
  53. [CVPR'23] BEVHeight: A Robust Framework for Vision-Based Roadside 3D Object Detection [PDF] [Code]
  54. [NeurIPS'23] Robust Asynchronous Collaborative 3D Detection via Bird's Eye View Flow [PDF] [Code]
  55. [NeurIPS'23] Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection [PDF] [Code]
  56. [NeurIPS'23] How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception [PDF] [Code]
  57. [TIV'23] HYDRO-3D: Hybrid object detection and tracking for cooperative perception using 3D LiDAR [PDF]
  58. [ICLR'23] CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving [PDF] [Code]
  59. [CoRL'23] BM2CP: Efficient Collaborative Perception with LiDAR-Camera Modalities [PDF] [Code]
  60. [ACMMM'23] DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception [PDF] [Code]
  61. [ACMMM'23] FeaCo: Reaching Robust Feature-Level Consensus in Noisy Pose Conditions [PDF] [Code]
  62. [ACMMM'23] What2comm: Towards Communication-Efficient Collaborative Perception via Feature Decoupling [PDF] [Code]
  63. [WACV'23] Adaptive Feature Fusion for Cooperative Perception Using LiDAR Point Clouds [PDF] [Code]
  64. [ICRA'23] Bridging the Domain Gap for Multi-Agent Perception [PDF] [Code]
  65. [ICRA'23] Deep Masked Graph Matching for Correspondence Identification in Collaborative Perception [PDF] [Code]
  66. [ICRA'23] Uncertainty Quantification of Collaborative Detection for Self-Driving [PDF] [Code]
  67. [ICRA'23] Model-Agnostic Multi-Agent Perception Framework [PDF] [Code]
  68. [CoRL'22] CoBEVT: Cooperative bird's eye view semantic segmentation with sparse transformers [PDF] [Code]
  69. [CVPR'22] COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles [PDF] [Code]
  70. [CVPR'22] Learning from All Vehicles [PDF] [Code]
  71. [ECCV'22] Latency-Aware Collaborative Perception [PDF] [Code]
  72. [CoRL'22] Multi-Robot Scene Completion: Towards Task-Agnostic Collaborative Perception [PDF] [Code]
  73. [ACMMM'22] Complementarity-Enhanced and Redundancy-Minimized Collaboration Network for Multi-agent Perception [PDF]
  74. [ICRA'22] Multi-Robot Collaborative Perception with Graph Neural Networks [PDF]

Decision-Making

  1. [ICCV'25 Workshop] Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning [PDF]
  2. [ICCV'25 Workshop] Contextual-Personalized Adaptive Cruise Control via Fine-Tuned Large Language Models [PDF]
  3. [ICCV'25 Workshop] Multi-modal Large Language Model for Training-free Vision-based Driver State [PDF]
  4. [ICCV'25 Workshop] V2X-based Logical Scenario Understanding with Vision-Language Models [PDF]
  5. [TMC'25] AgentsCoMerge: Large Language Model Empowered Collaborative Decision Making for Ramp Merging [PDF]
  6. [arXiv] A Vehicle-Infrastructure Multi-layer Cooperative Decision-making Framework [PDF]
  7. [arXiv'24] CoMAL: Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic [PDF][Code]
  8. [arXiv'24] AGENTSCODRIVER: Large Language Model Empowered Collaborative Driving with Lifelong Learning [PDF]
  9. [arXiv] Research on Autonomous Driving Decision-making Strategies based Deep Reinforcement Learning [PDF]
  10. [ECCV'24] MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs' Cooperative Decision-Making [PDF] [Code]
  11. [TITS'24] Cooperative decision-making for cavs at unsignalized intersections: A marl approach with attention and hierarchical game priors [PDF]
  12. [TITS'24] A Multi-Agent Reinforcement Learning Approach for Safe and Efficient Behavior Planning of Connected Autonomous Vehicles [PDF]
  13. [TVT'24] Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework [PDF][Code]
  14. [TIV'24] KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models [PDF] [Code]
  15. [ESWA'25] CCMA: A Framework for Cascading Cooperative Multi-agent in Autonomous Driving Merging using Large Language Models [PDF]
  16. [ICCV'25] CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving [PDF]
  17. [IEEE'25] LMMCoDrive: Cooperative Driving with Large Multimodal Models [PDF]
  18. [IEEE'25] DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences [PDF]
  19. [arXiv'23] LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving [PDF]
  20. [arXiv'25] Context-aware Decision Making in Autonomous Vehicles [PDF]
  21. [arXiv'25] Multi-Agent Deep Reinforcement Learning for Safe Autonomous Driving [PDF]
  22. [IEEE'25] Mixed Motivation Driven Social Multi-Agent Reinforcement Learning [PDF]
  23. [Frontiers'25] Multi-agent Reinforcement Learning Framework for Traffic Flow Management [PDF]
  24. [arXiv'25] Right-of-Way Based Multi-Agent Deep Reinforcement Learning [PDF]
  25. [arXiv'25] Cooperative Control of Self-Learning Traffic Signal and Connected Automated Vehicles [PDF]
  26. [arXiv'25] CoDrivingLLM: Towards Interactive and Learnable Cooperative Driving Automation [PDF] [Code]
  27. [arXiv'25] LangCoop: Collaborative Driving with Language [PDF]
  28. [arXiv'25] RSU-Assisted Cooperative Driving: A Multi-Agent Reinforcement Learning Approach [PDF]
  29. [arXiv'25] Debrief: Talking Vehicles - Cooperative Driving via Natural Language [PDF]
  30. [arXiv'25] NegoCollab: A Common Representation Negotiation Approach for Heterogeneous Collaborative Perception [PDF]
  31. [World Electric Vehicle Journal'24] A Review of Decision-Making and Planning for Autonomous Vehicles in Intersection Environments [PDF]
  32. [TVT'24] Decision-Making for Autonomous Vehicles in Random Task Scenarios at Unsignalized Intersection Using Deep Reinforcement Learning [PDF]
  33. [DDCLS'24] A Brief Survey of Deep Reinforcement Learning for Intersection Navigation of Autonomous Vehicles [PDF]
  34. [ICDE'24] Parameterized Decision-making with Multi-modality Perception for Autonomous Driving [PDF]
  35. [ICDE'24] Parameterized Decision-Making with Multi-Modality Perception for Autonomous Driving [PDF]
  36. [RAL'24] Language-driven policy distillation for cooperative driving in multi-agent reinforcement learning [PDF]
  37. [IoTML] Research on Autonomous Driving Decision-making Strategies based Deep Reinforcement Learning [PDF]
  38. [ITSC'23] Curriculum Proximal Policy Optimization with Stage-Decaying Clipping for Self-Driving at Unsignalized Intersections [PDF]
  39. [IV'23] Hybrid Decision Making for Autonomous Driving in Complex Urban Scenarios [PDF]
  40. [TIV'23] Robust Lane Change Decision Making for Autonomous Vehicles: An Observation Adversarial Reinforcement Learning Approach [PDF]
  41. [TITS'23] Robust Decision Making for Autonomous Vehicles at Highway On-Ramps: A Constrained Adversarial Reinforcement Learning Approach [PDF]
  42. [TVT'23] Exploiting Multi-Modal Fusion for Urban Autonomous Driving Using Latent Deep Reinforcement Learning [PDF]
  43. [TIV'23] A Multi-Vehicle Game-Theoretic Framework for Decision Making and Planning of Autonomous Vehicles in Mixed Traffic [PDF]
  44. [TVT'23] Towards Robust Decision-Making for Autonomous Driving on Highway [PDF][Code]
  45. [IEEE Transactions on Transportation Electrification'23] Interaction-Aware Decision-Making for Autonomous Vehicles [PDF]
  46. [ICRA'23] Failure Detection for Motion Prediction of Autonomous Driving: An Uncertainty Perspective [PDF]
  47. [TITS'23] Deep Multi-agent Reinforcement Learning for Highway On-Ramp Merging in Mixed Traffic [PDF][Code]
  48. [arXiv'23] Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios [PDF]
  49. [arXiv'23] Multi-Agent Reinforcement Learning Guided by Signal Temporal Logic Specifications [PDF]
  50. [TITS'23] Coordinating CAV Swarms at Intersections With a Deep Learning Model [PDF]
  51. [National Conference on Sensors'23] A Comprehensive Survey on Multi-Agent Reinforcement Learning for Connected and Automated Vehicles [PDF]
  52. [arXiv] Bringing Diversity to Autonomous Vehicles: An Interpretable Multi-vehicle Decision-making and Planning Framework [PDF]
  53. [ISSN'22] Reinforcement Learning-Based Autonomous Driving at Intersections in CARLA Simulator [PDF]
  54. [Autonomous Intelligent Systems'22] Multi-agent Reinforcement Learning for Cooperative Lane Changing of Connected and Autonomous Vehicles in Mixed Traffic [PDF]
  55. [TVT'22] Highway Decision-Making and Motion Planning for Autonomous Driving via Soft Actor-Critic [PDF]
  56. [TITS'22] PNNUAD: Perception Neural Networks Uncertainty Aware Decision-Making for Autonomous Vehicle [PDF]
  57. [CoRL'22] Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System [PDF]
  58. [TITS'22] Social Coordination and Altruism in Autonomous Driving [PDF]
  59. [TITS'22] Multi-Agent DRL-Based Lane Change With Right-of-Way Collaboration Awareness [PDF]
  60. [arXiv'22] Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges [PDF] [Code]
  61. [Autonomous Intelligent Systems'22] Multi-agent reinforcement learning for autonomous vehicles: a survey [PDF]
  62. [IROS'21] Cooperative Autonomous Vehicles that Sympathize with Human Drivers [PDF] [Code]
  63. [CoRL'20] SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving [PDF][Code]

Planning

  1. [ICCV'25 Workshop] V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction [PDF]
  2. [ICCV'25 Workshop] MAP: End-to-End Autonomous Driving with Map-Assisted Planning [PDF]
  3. [ICCV'25 Workshop] The Role of Radar in End-to-End Autonomous Driving [PDF]
  4. [ICCV'25 Workshop] Robust Scenario Mining Assisted by Multimodal Semantics [PDF]
  5. [ICCV'25 Workshop] Improving Event-Phase Captions in Multi-View Urban Traffic Videos via Prompt-Aware LoRA Tuning of Vision Language Models [PDF]
  6. [arXiv] CoDriveVLM: VLM-Enhanced Urban Cooperative Dispatching and Motion Planning for Future Autonomous Mobility on Demand Systems [PDF][Code]
  7. [arXiv] Improved Consensus ADMM for Cooperative Motion Planning of Large-Scale Connected Autonomous Vehicles with Limited Communication [PDF][Code]
  8. [arXiv] THOMAS: TRAJECTORY HEATMAP OUTPUT WITH LEARNED MULTI-AGENT SAMPLING [PDF]
  9. [arXiv] Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose Encoding [PDF)][Code]
  10. [TPAMI'24] MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention Querying [PDF]
  11. [AAAI'24] EDA: Evolving and Distinct Anchors for Multimodal Motion Prediction [PDF] [Code]
  12. [arXiv'25] LeAD: The LLM Enhanced Planning System Converged with End-to-End Autonomous Driving [PDF]
  13. [IEEE'25] LLMDriver: Autonomous Driving Planning Based on Large Language Models [PDF]
  14. [arXiv'25] An LLM-Powered Cooperative Framework for Large-Scale Multi-Vehicle Navigation [PDF]
  15. [ICRA'24] Parallel Optimization with Hard Safety Constraints for Cooperative Planning of Connected Autonomous Vehicles [PDF]
  16. [RAL'24] SIMPL: A Simple and Efficient Multi-Agent Motion Prediction Baseline for Autonomous Driving [PDF]
  17. [RAL'24] CMP: Cooperative Motion Prediction With Multi-Agent Communication [PDF]
  18. [arXiv'25] UNCAP: Uncertainty-Guided Neurosymbolic Planning [PDF]
  19. [Electronics'25] Eco-Cooperative Planning and Control of Connected Autonomous Vehicles Considering Energy Consumption Characteristics [PDF]
  20. [IEEE'25] Multiagent Trajectory Prediction With Difficulty-Guided Feature Enhancement [PDF]
  21. [ICCV'25] Unified Multi-Agent Trajectory Modeling with Masked Trajectory Diffusion [PDF]
  22. [NeurIPS'25] TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction [PDF]
  23. [arXiv'25] Co-MTP: A Cooperative Trajectory Prediction Framework with Multi-Temporal Fusion for Autonomous Driving [PDF]
  24. [arXiv'25] CoPAD: Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-Oriented Decoder in V2X Scenarios [PDF]
  25. [arXiv'25] V2X-RECT: An Efficient V2X Trajectory Prediction Framework via Redundant Interaction Filtering and Tracking Error Correction [PDF]
  26. [IEEE Internet of Things Journal'24] Coordination for Connected and Autonomous Vehicles at Unsignalized Intersections: An Iterative Learning-Based Collision-Free Motion Planning Method [PDF]
  27. [ICCV'23] BiFF: Bi-level Future Fusion with Polyline-based Coordinate for Interactive Trajectory Prediction [PDF]
  28. [CVPR'23] ProphNet: Efficient Agent-Centric Motion Forecasting with Anchor-Informed Proposals [PDF]
  29. [CVPR'23] FJMP: Factorized Joint Multi-Agent Motion Prediction over Learned Directed Acyclic Interaction Graphs [PDF][Code]
  30. [CVPR'23] MotionDiffuser: Controllable Multi-Agent Motion Prediction Using Diffusion [PDF
  31. [ICRA'23] Wayformer: Motion Forecasting via Simple & Efficient Attention Networks [PDF]
  32. [ICRA'23] GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting [PDF]
  33. [ICRA'23] GANet: Goal Area Network for Motion Forecasting [PDF][Code]
  34. [TITS'23] Decentralized iLQR for Cooperative Trajectory Planning of Connected Autonomous Vehicles via Dual Consensus ADMM [PDF]
  35. [TIV'23] Fault-tolerant cooperative driving at signal-free intersections [PDF]
  36. [TIV'23] OpenCDA-ROS: Enabling Seamless Integration of Simulation and Real-World Cooperative Driving Automation [PDF
  37. [TIV'23] Optimal Trajectory Planning for Connected and Automated Vehicles in Lane-Free Traffic With Vehicle Nudging [PDF]
  38. [TIV'23] Multi-Vehicle Conflict Management With Status and Intent Sharing Under Time Delays [PDF]
  39. [TIV'23] Optimizing Vehicle Re-Ordering Events in Coordinated Autonomous Intersection Crossings Under CAVs' Location Uncertainty [PDF]
  40. [TIV'23] Optimizing Vehicle Re-Ordering Events in Coordinated Autonomous Intersection Crossings Under CAVs' Location Uncertainty [PDF]
  41. [IEEE Robotics and Automation Letters'23] MacFormer: Map-Agent Coupled Transformer for Real-Time and Robust Trajectory Prediction [PDF]
  42. [ICCV'23] Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked Autoencoders [PDF]
  43. [CoRL'23] iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning[PDF][Code]
  44. [NeurIPS'22] Motion Transformer with Global Intention Localization and Local Movement Refinement [PDF] [Code]
  45. [CVPR'22] HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction [PDF][Code]
  46. [ICRA'22] MultiPath++: Efficient Information Fusion and Trajectory Aggregation for Behavior Prediction [PDF]
  47. [ICRA'22] GOHOME: Graph-Oriented Heatmap Output for future Motion Estimation [PDF]
  48. [TPAMI'22] HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory Prediction via Scene Encoding [PDF]
  49. [TITS'22] Cooperative Formation of Autonomous Vehicles in Mixed Traffic Flow: Beyond Platooning [PDF]
  50. [TVT'22] Multi-Lane Unsignalized Intersection Cooperation With Flexible Lane Direction Based on Multi-Vehicle Formation Control [PDF]
  51. [ICCV'21] DenseTNT: End-to-end Trajectory Prediction from Dense Goal Sets [PDF]
  52. [ITSC'21] OpenCDA: An Open Cooperative Driving Automation Framework Integrated with Co-Simulation [PDF [Code]

Communication

  1. [CVPR'24] Communication-Efficient Collaborative Perception via Information Filling with Codebook [PDF] [Code]
  2. [CVPR'24] ERMVP: Communication-Efficient and Collaboration-Robust Multi-Vehicle Perception in Challenging Environments [PDF] [Code]
  3. [CVPR'24] Multi-Agent Collaborative Perception via Motion-Aware Robust Communication Network [PDF] [Code]
  4. [ICRA'23] Communication-Critical Planning via Multi-Agent Trajectory Exchange [PDF]
  5. [ICRA'23] We Need to Talk: Identifying and Overcoming Communication-Critical Scenarios for Self-Driving [PDF]
  6. [IJCAI'22] Robust Collaborative Perception against Communication Interruption [PDF]
  7. [arXiv'25] InfoCom: Kilobyte-Scale Communication-Efficient Collaborative Perception with Information Bottleneck [PDF]
  8. [arXiv'25] Map4comm: A Map-Aware Collaborative Perception Framework with Efficient-Bandwidth Information Fusion [PDF]
  9. [ISPRS'25] MapCooper: A Communication-Efficient Collaborative Perception Framework via Map Alignment [PDF]
  10. [arXiv'25] TOCOM-V2I: Task-Oriented Communication for Vehicle-to-Infrastructure Cooperative Perception [PDF]
  11. [arXiv'25] PragComm: Pragmatic Communication in Multi-Agent Collaborative Perception [PDF]
  12. [arXiv'25] How2Compress: Scalable and Efficient Edge Video Analytics via Adaptive Granular Video Compression [PDF]

End-to-End

  1. [ICCV'25 Workshop] Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition [PDF]
  2. [IV'24] ICOP: Image-based Cooperative Perception for End-to-End Autonomous Driving [paper]
  3. [TIV'23] End-to-end Autonomous Driving with Semantic Depth Cloud Mapping and Multi-agent [paper] [code]
  4. [AAAI'22] CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-Based Autonomous Urban Driving [paper] [code]
  5. [NeurIPS'21] Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization [paper] [code] [Webpage]
  6. [arXiv] End-to-End Autonomous Driving through V2X Cooperation [paper] [code]
  7. [arXiv] AgentsCoMerge: Large Language Model Empowered Collaborative Decision Making for Ramp Merging [paper]
  8. [arXiv] AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning [paper]
  9. [NeurIPS'25] AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning [Webpage]
  10. [arXiv'25] V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models [PDF]
  11. [arXiv'25] V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts [PDF]
  12. [arXiv'25] V2X-REALM: Vision-Language Model-Based Robust End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling [PDF]
  13. [arXiv'25] V2X-UniPool: Unifying Multimodal Perception and Knowledge Reasoning for Autonomous Driving [PDF]
  14. [arXiv'25] UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving [PDF]

Dataset and Simulator

Dataset

  1. [ICCV'25 Workshop] HetroD: A High-Fidelity Drone Dataset and Benchmark for Heterogeneous Traffic in Autonomous Driving [PDF]
  2. [ICCV'21] V2X-Sim: Multi-Agent Collaborative Perception Dataset and Benchmark for Autonomous Driving [PDF] [Code] [Webpage] V2X-Sim
  3. [ACCV'22] DOLPHINS: Dataset for Collaborative Perception Enabled Harmonious and Interconnected Self-Driving [PDF] [Code] [Webpage] DOLPHINS
  4. [ICRA'22] OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication [PDF] [Code] [Webpage] OPV2V
  5. [ECCV'22] V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer [PDF] [Code] [Webpage] V2X-ViT
  6. [CVPR'22] COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles [PDF] [Code] [Webpage] AutoCastSim
  7. [CVPR'22] DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection [PDF] [Code] [Webpage] DAIR-V2X
  8. [NeurIPS'22] Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps [PDF&review] [Code] [Webpage] CoPerception-UAV
  9. [NeurIPS'23] Robust Asynchronous Collaborative 3D Detection via Bird's Eye View Flow [PDF&review] IRV2V
  10. [CVPR'23] Collaboration Helps Camera Overtake LiDAR in 3D Detection [PDF] [Code] [Webpage] CoPerception-UAV+ OPV2V+
  11. [CVPR'23] V2V4Real: A Large-Scale Real-World Dataset for Vehicle-to-Vehicle Cooperative Perception [PDF] [Code] [Webpage] V2V4Real
  12. [CVPR'23] V2X-Seq: The Large-Scale Sequential Dataset for the Vehicle-Infrastructure Cooperative Perception and Forecasting [PDF] [Code] [Webpage] DAIR-V2X-Seq
  13. [ICRA'23] Robust Collaborative 3D Object Detection in Presence of Pose Errors [PDF] [Code] [Webpage] DAIR-V2X-C Complemented
  14. [ICCV'23] Optimizing the Placement of Roadside LiDARs for Autonomous Driving [PDF] Roadside-Opt
  15. [AAAI'24] DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving [PDF] [Code] [Webpage] DeepAccident
  16. [ICLR'24] An Extensible Framework for Open Heterogeneous Collaborative Perception [PDF&review] [Code] [Webpage] OPV2V-H
  17. [CVPR'24] HoloVIC: Large-Scale Dataset and Benchmark for Multi-Sensor Holographic Intersection and Vehicle-Infrastructure Cooperative [PDF] [Webpage] HoloVIC
  18. [CVPR'24] Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset [PDF] [Code] [Webpage] Open Mars Dataset
  19. [CVPR'24] RCooper: A Real-World Large-Scale Dataset for Roadside Cooperative Perception [PDF] [Code] [Webpage] RCooper
  20. [CVPR'24] TUMTraf V2X Cooperative Perception Dataset [PDF] [Code] [Webpage] TUMTraf-V2X
  21. [CVPR'24] Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents [PDF] [Code] [Webpage] ChatSim
  22. [ECCV'24] H-V2X: A Large Scale Highway Dataset for BEV Perception [PDF] H-V2X
  23. [NeurIPS'24] Learning Cooperative Trajectory Representations for Motion Forecasting [PDF] [Code] [Webpage] DAIR-V2X-Traj
  24. [NeurIPS'24] SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction [PDF] [Code] [Webpage] SMART
  25. [CVPR'25] Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking [PDF] Mono3DVLT-V2X
  26. [CVPR'25] RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions [PDF] RCP-Bench
  27. [CVPR'25] V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion [PDF] [Code] V2X-R
  28. [arXiv] Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions [PDF] [Code] [Webpage] Adver-City
  29. [arXiv] CP-Guard+: A New Paradigm for Malicious Agent Detection and Defense in Collaborative Perception [PDF] CP-GuardBench
  30. [arXiv] DriveGen: Toward Infinite Diverse Traffic Scenarios with Large Models [PDF] DriveGen
  31. [arXiv] Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and Benchmark [PDF] [Code] [Webpage] Griffin
  32. [arXiv] InScope: A New Real-world 3D Infrastructure-side Collaborative Perception Dataset for Open Traffic Scenarios [PDF] [Code] InScope
  33. [arXiv] Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration [PDF] [Code] [Webpage] Mixed Signals
  34. [arXiv] Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception [PDF] [Code] Multi-V2X
  35. [arXiv] RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling [PDF] OPV2V-N
  36. [arXiv] V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models [PDF] [Code] [Webpage] V2V-QA
  37. [arXiv] V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction [PDF] [Code] [Webpage] V2XPnP-Seq
  38. [arXiv] V2X-Radar: A Multi-Modal Dataset with 4D Radar for Cooperative Perception [PDF] [Webpage] V2X-Radar
  39. [arXiv] V2X-Real: a Large-Scale Dataset for Vehicle-to-Everything Cooperative Perception [PDF] [Webpage] V2X-Real
  40. [arXiv] V2X-ReaLO: An Open Online Framework and Dataset for Cooperative Perception in Reality [PDF] V2X-ReaLO
  41. [arXiv] WHALES: A Multi-Agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving [PDF] [Code] [Webpage] WHALES
  42. [arXiv] DriveGen: Towards Infinite Diverse Traffic Scenarios with Large Models [PDF] DriveGen-CS
  43. [NeurIPS'25] UrbanIng-V2X: A Large-Scale Multi-Vehicle, Multi-Infrastructure Dataset Across Multiple Intersections [PDF] [Code] UrbanIng-V2X
  44. [arXiv'25] CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios [PDF] [Code] CATS-V2V
  45. [arXiv'25] AGC-Drive: A Large-Scale Dataset for Real-World Aerial-Ground Collaboration in Driving Scenarios [PDF] [Code] AGC-Drive
  46. [arXiv'25] TruckV2X: A Truck-Centered Perception Dataset [PDF]
  47. [arXiv'25] AirV2X: Unified Air-Ground Vehicle-to-Everything Collaboration [PDF]
  48. [arXiv'25] CoInfra: A Large-Scale Cooperative Infrastructure Perception System and Dataset in Adverse Weather [PDF]

Simulator

  1. [CoRL'17] CARLA: An Open Urban Driving Simulator [PDF] [Code] [Webpage] CARLA
  2. [NeurIPS'24] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking [PDF] [Code] [Webpage] NAVSIM
  3. [arXiv] DriveGen: Towards Infinite Diverse Traffic Scenarios with Large Models [PDF] DriveGen

Security and Robustness

  1. [TMC'25] Collaborative Perception Against Data Fabrication Attacks in Vehicular Networks [PDF]
  2. [AAAI'25 Oral] CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird's Eye View Perception [PDF, Code]
  3. [IROS'24] Malicious Agent Detection for Robust Multi-Agent Collaborative Perception [PDF, Code]
  4. [ICRA'24] AdvGPS: Adversarial GPS for Multi-Agent Perception Attack [PDF] [Code]
  5. [JATS'24] RAMPART: Reinforcing Autonomous Multi-Agent Protection through Adversarial Resistance in Transportation [PDF]
  6. [TITS'24] A Survey of Multi-Vehicle Consensus in Uncertain Networks for Autonomous Driving [PDF]
  7. [USENIX Security'24] On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures [PDF]
  8. [AAAI'24] Robust Communicative Multi-Agent Reinforcement Learning with Active Defense [PDF]
  9. [VehicleSec'23] Cooperative Perception for Safe Control of Autonomous Vehicles under LiDAR Spoofing Attacks [PDF]
  10. [ICCV'23] Among Us: Adversarially Robust Collaborative Perception by Consensus [PDF, Code]
  11. [TDSC'23] MARNet: Backdoor Attacks Against Cooperative Multi-Agent Reinforcement Learning [PDF]
  12. [NeurIPS'23] Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning [PDF]
  13. [TITS'22] A Survey on Cyber-Security of Connected and Autonomous Vehicles (CAVs) [PDF]
  14. [ICCV'21] Adversarial Attacks On Multi-Agent Communication [PDF]
  15. [arXiv'25] GCP: Guarded Collaborative Perception with Spatial-Temporal Aware Malicious Agent Detection [PDF]
  16. [arXiv'25] CP-Guard+: A New Paradigm for Malicious Agent Detection and Defense in Collaborative Perception [PDF]
  17. [arXiv'25] CP-FREEZER: Latency Attacks against Cooperative Perception [PDF]
  18. [arXiv'25] DSRC: Learning Density-Insensitive and Semantic-Aware Collaborative Representation against Corruptions [PDF]
  19. [arXiv'25] SafeCoop: Unravelling Full Stack Safety in Agentic Collaborative Driving [PDF]
  20. [arXiv'25] CoDynTrust: Robust Asynchronous Collaborative Perception via Dynamic Feature Trust Modulus [PDF]
  21. [arXiv] A Multi-Agent Security Testbed for the Analysis of Attacks and Defenses in Collaborative Sensor Fusion [PDF]
  22. [ACM Computing Surveys] Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning [PDF]
  23. [USENIX Security'25] From Threat to Trust: Exploiting Attention Mechanisms for Attacks and Defenses in Cooperative Perception (SOMBRA & LUCIA) [PDF]
  24. [ICCV'25] Pretend Benign: A Stealthy Adversarial Attack by Exploiting Vulnerabilities in Cooperative Perception [PDF]
  25. [IEEE'25] Robust Collaborative Perception: Combining Adversarial Training with Consensus Mechanism for Enhanced V2X Security [PDF]

Star History

Star History Chart

About

All you need for Multi-Agent Embodied Autonomous Driving (MAAD)

Resources

License

Stars

Watchers

Forks

Contributors