Skip to content

Initial impl of go explore.#219

Open
ldoshi wants to merge 93 commits intomasterfrom
goexplore
Open

Initial impl of go explore.#219
ldoshi wants to merge 93 commits intomasterfrom
goexplore

Conversation

@ldoshi
Copy link
Owner

@ldoshi ldoshi commented Jun 14, 2024

No description provided.

def __init__(self, rng):
self._rng = rng

def update_times_since_lead_something_new(self, state, led_to_something_to_new: bool) -> None:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is “lead” different from “led” or “led to”?

@ldoshi
Copy link
Owner Author

ldoshi commented Jun 14, 2024 via email

@pjantrania
Copy link
Collaborator

pjantrania commented Jun 14, 2024 via email

@ldoshi
Copy link
Owner Author

ldoshi commented Jun 14, 2024 via email

lint-action and others added 15 commits February 4, 2025 05:49
Support converting states to cells to allow various ideas for merging states together.
chunk start entries to send more work per process
In preparation for making the cache in rollout only hold diffs, remove some local vars and plumb hparams in deeper.

still runs:

python go_explore_phase_1.py --env-width=4 --go-explore-num-iterations=8 --cell-manager=downsample_cell_manager                                                                           
==========                                                                                                                                                                                                                                                     
Entry Count: 5                                                                                                                                                                                                                                                 
 * wa-sampled: 0.1                                                                                                                                                                                                                                             
 * wa-new: 0                                                                                                                                                                                                                                                   
 * wa-visit: 0.3                                                                                                                                                                                                                                               
Shortest: [2, 2, 3, 3, 5]
@josephmaa
Copy link
Collaborator

Use generate_heat_map.py in order to visualize heatmap of go-explore run!

josephmaa and others added 4 commits March 3, 2025 22:23
* Added updated heatmap code

* Fix issue with summing

* rename to occurrenceentry
rollouts use cache diffs to track changes.
@josephmaa
Copy link
Collaborator

Let's add the steps_since_led_to_something_new to sibyl and plot n states with the fewest across time

@josephmaa
Copy link
Collaborator

Resolve circular import and get the data out of the cache_entry database. Should be easy to push that data to the frontend, where we want to see columns of data vertically arranged for the top n metrics sorted for cache entries

ldoshi and others added 6 commits March 20, 2025 23:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants