Refactor policy network code and remove LSTM as default by riccardosavorgnan · Pull Request #393 · Emerge-Lab/PufferDrive

riccardosavorgnan · 2026-04-09T17:00:55Z

Still WIP: refactoring of the neural net code for the policy plus remove the LSTM as default (MLP policy is now default)

riccardosavorgnan · 2026-04-09T17:02:55Z

+        return logits, value
+
+
+class MultiDiscreteDriveLSTM(Policy):


This is still WIP, it might contain incomplete method and serious bugs.

riccardosavorgnan · 2026-04-09T17:03:56Z

    assert_near(output_puffer, output_torch.numpy())


-def test_drive(batch_size=1, input_size=512, hidden_size=512):


will re-introduce before final review

semi working version, still WIP for LSTM network

c6772f5

riccardosavorgnan commented Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor policy network code and remove LSTM as default#393

Refactor policy network code and remove LSTM as default#393
riccardosavorgnan wants to merge 1 commit into3.0from
ricky/refactor_policy_remote_lstm

riccardosavorgnan commented Apr 9, 2026 •

edited

Loading

Uh oh!

riccardosavorgnan Apr 9, 2026

Uh oh!

riccardosavorgnan Apr 9, 2026

Uh oh!

eugenevinitsky Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		assert_near(output_puffer, output_torch.numpy())


		def test_drive(batch_size=1, input_size=512, hidden_size=512):

Conversation

riccardosavorgnan commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

riccardosavorgnan Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

riccardosavorgnan Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

eugenevinitsky Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

riccardosavorgnan commented Apr 9, 2026 •

edited

Loading