feat: switch from Haiku to Equinox by panahiparham · Pull Request #21 · andnp/rl-control-template

panahiparham · 2025-12-29T05:29:00Z

In this work in progress PR I am working on switching from Haiku to Equinox. The main changes are,

There is no longer an AgentState attribute, instead there are network and opt_state attributes (DQN also has a target_network)
Now we use equinox and its corresponding filtered jax operations to handle neural nets creation and training

What we are currently missing,

Anything except ReluNets (so mainly MinatarNet and AtariNet)
Speed (Code runs a little slow compared to Haiku
Type annotations for additions to DQN and EQRC

What I have verified so far,

Run DQN and EQRC and MC and checked it reached good performance in 1 seed
Checked that action values do not explode and do not turn Nan

Please let me know about the above decisions and if you think we should handle them in a different way. Please also point out areas of improvement for speed, code quality, and bugs.

Breaking Change

panahiparham · 2025-12-29T05:30:10Z

src/algorithms/nn/DQN.py


        if self.updates % self.target_refresh == 0:
-            self.state.target_params = self.state.params
+            self.target_network = copy(self.network)


Is this an appropriate way to handle target nets?

panahiparham · 2025-12-29T05:32:18Z

src/algorithms/nn/DQN.py

-import utils.chex as cxu
-
-@cxu.dataclass
-class AgentState:


What do you think about removing the AgentState? We can add it back in and store the model params alongside optimizer parameters. To perform forward passes we would use eqx.partition and eqx.combine a lot of times. Will it be slow?

panahiparham · 2025-12-29T05:32:57Z

src/algorithms/nn/EQRC.py

-        assert isinstance(updates, dict)
-
-        decay = tree_map(
+        updates.heads['h'] = jax.tree.map(


This step should be verified carefully.

panahiparham · 2025-12-29T05:34:02Z

src/representations/networks.py

+        assert len(inputs) == 1
+        key_1, key_2 = jax.random.split(key, 2)
+
+        return eqx.nn.Sequential([


Alternatively, each of these can be its own class in utils/eqx.py instead of being a Sequential.

panahiparham · 2025-12-29T05:35:05Z

src/utils/eqx.py

+import jax
+import equinox as eqx
+
+class MultiHead(eqx.Module):


What do you think about this? I wanted a multihead network to accomodate both QRC and DQN.

artemis79 · 2026-01-04T00:04:48Z

src/algorithms/nn/DQN.py

-            target_params=self.state.params,
-            optim=self.state.optim,
-        )
+        self.target_network = copy(self.network)


Maybe this should be deep copy instead of copy, so it copies the values of the object instead of the reference.

feat: switch from Haiku to Equinox

c4af8c0

Breaking Change

panahiparham commented Dec 29, 2025

View reviewed changes

artemis79 reviewed Jan 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: switch from Haiku to Equinox#21

feat: switch from Haiku to Equinox#21
panahiparham wants to merge 1 commit intoandnp:mainfrom
panahiparham:main

panahiparham commented Dec 29, 2025

Uh oh!

panahiparham Dec 29, 2025

Uh oh!

panahiparham Dec 29, 2025

Uh oh!

panahiparham Dec 29, 2025

Uh oh!

panahiparham Dec 29, 2025

Uh oh!

panahiparham Dec 29, 2025

Uh oh!

artemis79 Jan 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

panahiparham commented Dec 29, 2025

Uh oh!

panahiparham Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

panahiparham Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

panahiparham Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

panahiparham Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

panahiparham Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

artemis79 Jan 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants