Enable batch training by Oculux314 · Pull Request #134 · UoA-CARES/gymnasium_envrionments

Oculux314 · 2025-12-05T13:49:50Z

You can now run a batch of trainings with different configurations by specifying --batch. Works with parallel executions. Regular execution is unchanged if --batch is not specified.

You can set up the configurations in gymnasium_envrionments/scripts/batch_coordinator.py. E.g.

batch_config: dict[str, list[Any | tuple[Any, str]]] = {
    "alg_config.gamma": [0.9, 0.95],
    "env_config.task": ["run", "swingup"],
}

Will set up 4 runs:

gamma-0.9 task-run
gamma-0.95 task-run
gamma-0.9 task-swingup
gamma-0.95 task-swingup

You can edit the _skip() function for more fine-grained control.

--b_start and --b_end allow you to specify only running a range.

Edited the readme with this information.

OpenAI DDMCS ReLU GELU GoLU singlelayer default

…re actually skipped

…ping more predictable

beardyFace · 2025-12-14T20:00:04Z

scripts/batch_coordinator.py

+from cares_reinforcement_learning.util.configurations import (
+    FunctionLayer,
+    MLPConfig,
+    TrainableLayer,
+)
+
+# MARK: ACTIVATION LAYERS
+
+# GoLU
+golu_a: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="GoLU"),
+    ]
+)
+golu_c: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="GoLU"),
+        TrainableLayer(layer_type="Linear", in_features=256, out_features=1),
+    ]
+)
+
+# GELU
+gelu_a: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="GELU"),
+    ]
+)
+gelu_c: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="GELU"),
+        TrainableLayer(layer_type="Linear", in_features=256, out_features=1),
+    ]
+)
+
+# ReLU
+relu_a: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="ReLU"),
+    ]
+)
+relu_c: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="ReLU"),
+        TrainableLayer(layer_type="Linear", in_features=256, out_features=1),
+    ]
+)
+
+# Leaky ReLU
+leaky_a: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="LeakyReLU"),
+    ]
+)
+leaky_c: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="LeakyReLU"),
+        TrainableLayer(layer_type="Linear", in_features=256, out_features=1),
+    ]
+)
+
+# PReLU
+prelu_a: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="PReLU"),
+    ]
+)
+prelu_c: MLPConfig = MLPConfig(
+    layers=[
+        TrainableLayer(layer_type="Linear", out_features=256),
+        FunctionLayer(layer_type="PReLU"),
+        TrainableLayer(layer_type="Linear", in_features=256, out_features=1),
+    ]
+)


Why are these activation functions here?

Ah, so that's how you configure the batches - you need to write them in code (I considered JSON, but I think that doesn't bring any benefit). I've been using this for Hoda's activation functions which is why these are here, but ty for the suggestion - I'll strip them down to a more generic example for this PR

Also adds some dependencies needed to run with GPUs on our machines.

Oculux314 · 2025-12-19T00:32:08Z

Done! Edited README as well

This prevents it from running batches when resuming

Oculux314 and others added 14 commits December 6, 2025 03:44

add batch script (unfinished)

925162e

correct config_templates

fd02050

integrate batch code into main run.py

ad0a1a4

fix bugs in batch run logic

8e2413e

implement a better way to name batch runs

de475eb

allow skipping certain batch configurations

6519801

start wk2 Fri tests

afa7ed8

OpenAI DDMCS ReLU GELU GoLU singlelayer default

improve batch logging/confirmation

7d82ddd

fix inverted _skip function for GoLU runs

6941a20

add temporary code to allow skipping runs

9dd8289

add strip_logs script

d5df26b

switch to batch leaky relu and prelu

775015a

enable only running a range during batching

5e66e31

Auto-format code 🧹🌟🤖

a16fa92

Oculux314 force-pushed the nwil508 branch from 9b81c96 to a16fa92 Compare December 5, 2025 14:44

Oculux314 and others added 7 commits December 6, 2025 03:45

Merge branch 'main' into nwil508

e2b33f9

fix batch range logic

67a34b2

switch to DMCS gym

440be54

add CARES base dockerfile

88ab50b

fix bug where runs marked as skipped were desynced from runs which we…

c54a9cd

…re actually skipped

tie batch range skipping logic to the coordinator object to make skip…

8d0d9e6

…ping more predictable

Auto-format code 🧹🌟🤖

7c215fc

Oculux314 requested a review from beardyFace December 13, 2025 21:12

beardyFace requested changes Dec 14, 2025

View reviewed changes

Oculux314 and others added 4 commits December 19, 2025 12:52

remove activation function-specific batch configs

83383e2

switch dockerfile to use main

4201a1e

Also adds some dependencies needed to run with GPUs on our machines.

Add how to use batch runs to README

e0f39d1

Auto-format code 🧹🌟🤖

aea0e4d

Oculux314 requested a review from beardyFace December 19, 2025 00:31

Oculux314 and others added 4 commits December 19, 2025 13:46

add docker section to readme

3eacc0c

ensure batch logic only runs when training

742edb4

This prevents it from running batches when resuming

suppress spam repeated config printing in batch runs

c298145

Auto-format code 🧹🌟🤖

860a7a3

Oculux314 mentioned this pull request Dec 22, 2025

Add Dockerfile #39

Closed

Oculux314 and others added 7 commits December 29, 2025 21:19

apply long-running cuda fix

b5ff646

change Dockerfile log path template

de58819

fix CARES_LOG_PATH_TEMPLATE to use run_name

fd6fe4b

add information about nvidia Docker bug

1b3c1ef

Merge remote-tracking branch 'origin/main' into nwil508

ad87d31

remove unnecessary git branch changes and add welcome message

fefbc3e

add CUDA version warning

ac641ef

beardyFace mentioned this pull request Mar 16, 2026

Batch Scripts UoA-CARES/cares_reinforcement_learning#357

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable batch training#134

Enable batch training#134
Oculux314 wants to merge 36 commits intomainfrom
nwil508

Oculux314 commented Dec 5, 2025 •

edited

Loading

Uh oh!

beardyFace Dec 14, 2025

Uh oh!

Oculux314 Dec 18, 2025

Uh oh!

Oculux314 commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Oculux314 commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beardyFace Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

Oculux314 Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Oculux314 commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Oculux314 commented Dec 5, 2025 •

edited

Loading