Skip to content

Commit 2d6f8d2

Browse files
authored
feat: OpenCDA-MARL v1.0.0 - Multi-Agent RL framework
1 parent c956f2d commit 2d6f8d2

File tree

108 files changed

+47231
-4617
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

108 files changed

+47231
-4617
lines changed

.gitignore

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -152,5 +152,10 @@ ready*
152152

153153
!/recordings/*.json
154154
/checkpoints/*/*episode_*.*
155-
/checkpoints/*/latest_checkpoint.*
155+
checkpoints/**/best_checkpoint.*
156+
!/checkpoints/**/latest_checkpoint.*
156157
/archived_docs
158+
/.spec-workflow
159+
/runs
160+
/metrics_history
161+
/.pixi

README.md

Lines changed: 9 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -15,38 +15,28 @@ The key features of OpenCDA-MARL are:
1515
* <strong>Scalability</strong>: Distributed training infrastructure supporting large-scale multi-agent scenarios with hundreds of vehicles.
1616
* <strong>Mixed Autonomy</strong>: Support for mixed traffic with human-driven vehicles, rule-based AVs, and learning-based agents.
1717

18-
Users can refer to our [documentation](#) for detailed guides on MARL integration, training procedures, and API references. For the original OpenCDA documentation, visit [OpenCDA documentation](https://opencda-documentation.readthedocs.io/en/latest/).
18+
Users can refer to our [documentation](https://radar-lab.github.io/OpenCDA-MARL/) for detailed guides on MARL integration, training procedures, and API references. For the original OpenCDA documentation, visit [OpenCDA documentation](https://opencda-documentation.readthedocs.io/en/latest/).
1919

2020
## What's New in OpenCDA-MARL
2121

2222
### August 2025
2323

24-
* **MARL Framework Integration**: Full integration of Multi-Agent Reinforcement Learning capabilities with support for PPO, SAC, QMIX, and MADDPG algorithms.
25-
* **Distributed Training**: Scalable training infrastructure using Ray/RLlib for large-scale multi-agent scenarios.
26-
* **Mixed Autonomy Support**: Seamless integration of learning-based agents with rule-based vehicles and human-driven traffic.
27-
28-
### Key Updates from Original OpenCDA
29-
30-
* **Environment Changes**: Changed Conda environment to Pixi for easy installation.
31-
* **Enhanced Configuration System**: Clean YAML-based configuration with `default.yaml` template
32-
* **Docker Support**: Easy deployment and reproducibility
33-
* **Windows Compatibility**: Full support for Windows with Python 3.10.x and CUDA 12.8
34-
* **HD Map Manager**: Real-time rasterization maps for RL planning
35-
* **CARLA 0.9.15**: Latest CARLA version support with improved stability
24+
* **MARL Framework Integration**: Core Multi-Agent Reinforcement Learning framework with implemented algorithms including Q-learning, DQN, and TD3 for intersection management and cooperative driving tasks.
25+
* **Training Infrastructure**: Single-agent training capabilities with experience replay and checkpoint management, with distributed Ray/RLlib training planned for future releases.
26+
* **Mixed Autonomy Support**: Seamless integration of learning-based MARL agents with rule-based vehicles, vanilla behavior agents, and human-driven traffic.
3627

3728
## Major Components
3829

3930
![teaser](docs/images/OpenCDA_MARL_architecture.png)
4031

4132
OpenCDA-MARL extends the original four components with MARL-specific modules:
4233

43-
* <strong>MARL Training Framework</strong>: Distributed training infrastructure with multiple RL algorithms
44-
* <strong>Cooperative Driving System</strong>: Enhanced with learning-based decision making
45-
* <strong>Co-Simulation Tools</strong>: CARLA + SUMO integration with RL environment wrapper
34+
* <strong>MARL Training Framework</strong>: Core training infrastructure with Q-learning, DQN, and TD3 algorithms for single-agent and multi-agent scenarios
35+
* <strong>Cooperative Driving System</strong>: Enhanced with learning-based decision making for cooperative driving tasks
4636
* <strong>Data Manager and Repository</strong>: Training data collection and replay buffer management
4737
* <strong>Scenario Manager</strong>: MARL-specific training and evaluation scenarios
4838

49-
Check our [documentation](#) for detailed architecture and MARL integration.
39+
Check our [documentation](https://radar-lab.github.io/OpenCDA-MARL/marl/architecture/) for detailed architecture and MARL integration.
5040

5141
## Get Started
5242

@@ -62,8 +52,8 @@ Note: We continuously improve the performance of OpenCDA-MARL. Currently, it is
6252

6353
### Developer Guide
6454

65-
* [Class Design](https://radar-lab.github.io/OpenCDA-MARL/architecture/)
66-
* [Customize Your Algorithms](#)
55+
* [Class Design](https://radar-lab.github.io/OpenCDA-MARL/marl/architecture/)
56+
* [Customize Your Algorithms](https://radar-lab.github.io/OpenCDA-MARL/marl/algorithms/)
6757
* [API Reference](https://radar-lab.github.io/OpenCDA-MARL/api/opencda-marl/overview/) <br>
6858

6959
### Contributing
617 KB
Binary file not shown.
976 KB
Binary file not shown.
2.31 MB
Binary file not shown.
4.62 MB
Binary file not shown.
14.3 MB
Binary file not shown.
93.5 MB
Binary file not shown.
93.5 MB
Binary file not shown.
3.48 MB
Binary file not shown.

0 commit comments

Comments
 (0)