Skip to content

hejm37/da-ac

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distributions-as-Actions Actor-Critic (DA-AC)

Reference implementation of Distributions-as-Actions Actor-Critic (DA-AC) for continuous and discrete control settings. See Distributions as Actions: A Unified Framework for Diverse Action Spaces, accepted at ICLR 2026.

Note: This repository includes only continuous and discrete control code. Hybrid-control code is available here.

Implemented Algorithms

This repository includes DA-AC variants and standard baselines.

Getting Started

Prerequisites

Installation

Install only the dependency groups you need:

pip install -r requirements/requirements-dm_control.txt
pip install -r requirements/requirements-mujoco.txt
pip install -r requirements/requirements-gym.txt
pip install -r requirements/requirements-minatar.txt

Run Experiments

Example (continuous control):

python da_ac/da_ac_continuous_mjc_dmc.py --env-id Hopper-v4 --seed 1 --total-timesteps 1000000

Most script-level hyperparameters are in each script’s Args dataclass (for example, da_ac_continuous_mjc_dmc.py and td3_continuous_mjc_dmc.py).

Experiment Tracking (Weights & Biases)

  1. Log in to W&B:

    wandb login
  2. Add --track when launching experiments:

    python da_ac/da_ac_continuous_mjc_dmc.py \
            --env-id Hopper-v4 \
            --track \
            --wandb-project-name your_da_ac_project \
            --wandb-entity your_wandb_username

TensorBoard logs are also saved locally under runs/.

Repository Structure

Citation

If you use this codebase, please cite:

@inproceedings{he2026distributions,
    title={Distributions as Actions: A Unified Framework for Diverse Action Spaces},
    author={Jiamin He and A. Rupam Mahmood and Martha White},
    booktitle={The Fourteenth International Conference on Learning Representations},
    year={2026},
    url={https://openreview.net/forum?id=4ol71wMPY8}
}

License

This project is licensed under the terms in LICENSE.

About

Code for Distributions as Actions: A Unified Framework for Diverse Action Spaces.

Resources

License

Stars

Watchers

Forks

Contributors

Languages