shared_train_eval_env #1732

Haichao-Zhang · 2025-03-13T16:15:44Z

Provides an option to train and eval on the same shared env, mimicking the case such as training in real with only one physical env.

le-horizon · 2025-03-13T18:15:43Z

alf/trainers/evaluator.py

-                seed=seed)
+            if config.shared_train_eval_env:
+                self._env = alf.get_env()
+                self._env.reset()


assert async_eval = False?

Good point. Added assertion.

Commit not pushed to the right remote?

ah, yes, that was what happened ... now pushed

Happened to me also. It's hard to remember, especially now that we don't change alf that often. We can probably remove the other remote.

le-horizon · 2025-03-14T15:43:04Z

alf/trainers/evaluator.py

+            if config.shared_train_eval_env:
+                assert not self._async, "should not use async_eval in shared_train_eval_env mode"
+                self._env = alf.get_env()
+                self._env.reset()


Curious, why do we call env.reset() here but not in the other branch. Maybe add a comment in the code?

emailweixu · 2025-03-24T23:41:33Z

alf/trainers/evaluator.py

-                for_evaluation=True,
-                num_parallel_environments=num_envs,
-                seed=seed)
+            if config.shared_train_eval_env:


Need to set the step_type in the replay buffer just before evaluation started to StepType.LAST.

Also need to set the next step type for training to FIRST

shared_train_eval_env

6e139a0

le-horizon reviewed Mar 13, 2025

View reviewed changes

Address comments

00ffd0c

le-horizon reviewed Mar 14, 2025

View reviewed changes

emailweixu reviewed Mar 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

shared_train_eval_env #1732

shared_train_eval_env #1732

Uh oh!

Haichao-Zhang commented Mar 13, 2025

Uh oh!

le-horizon Mar 13, 2025

Uh oh!

Haichao-Zhang Mar 13, 2025

Uh oh!

le-horizon Mar 14, 2025

Uh oh!

Haichao-Zhang Mar 14, 2025

Uh oh!

le-horizon Mar 14, 2025

Uh oh!

le-horizon Mar 14, 2025

Uh oh!

emailweixu Mar 24, 2025

Uh oh!

emailweixu Mar 24, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shared_train_eval_env #1732

Are you sure you want to change the base?

shared_train_eval_env #1732

Uh oh!

Conversation

Haichao-Zhang commented Mar 13, 2025

Uh oh!

le-horizon Mar 13, 2025

Choose a reason for hiding this comment

Uh oh!

Haichao-Zhang Mar 13, 2025

Choose a reason for hiding this comment

Uh oh!

le-horizon Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

Haichao-Zhang Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

le-horizon Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

le-horizon Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

emailweixu Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

emailweixu Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

emailweixu Mar 24, 2025 •

edited

Loading