[RFC] StoreInterface #77

casteryh · 2025-08-26T23:25:53Z

Summary:
This is a proposed interface that will be the backend of the replay buffer. I have included methods that would be useful from the point of view of making a replay buffer.

This also doubles as a proposal for the actual torchstore api.

Test Plan:
n/a

src/forge/interfaces.py

Summary: This is a proposed interface that will be the backend of the replay buffer. I have included methods that would be useful from the point of view of making a replay buffer. This also doubles as a proposal for the actual torchstore api. Test Plan: n/a

casteryh · 2025-09-09T22:12:36Z

@LucasLLC

casteryh · 2025-09-09T22:20:54Z

Also @kaiyuan-li

joecummings · 2025-09-09T22:29:16Z

src/forge/interfaces.py

+
+    # TODO(yuxuanh): add this to torchstore.
+    @abstractmethod
+    async def release(self, key: str) -> None:


Interesting idea: what's the inspiration here?

Say the trainer is at step 10 and will no longer need stuff from step 5. A reasonable thing to do would be simply mark all keys starting with replay_buffer.step_10 as released and move on, instead of waiting it to be actually deleted.

While from the torchstore side it's probably easier to implement this as instant deletion right now, it would be nice to have this semantics, for if and when we hit a scale where this matters.

However, on the other hand, since everything is implemented in Python. It's probably fast enough to just delete instantly since we don't deallocate memory when deleting. Indeed, currently all keys are held by a single process Controller actor in torchstore right now - so it makes less sense to reinvent GC ourself.

Things do get complicated if we need to shard the controller. And it's much easier to just not make any promises.

In this regard, we should probably remove the delete[_all] methods all together, as it would be a nightmare to do it correctly in a distributed setting.

There could be some perf gains here where we notify controller of delete and then let storage volumes garbage collect later.

DNXie · 2025-09-10T18:18:57Z

src/forge/interfaces.py

+
+    # TODO(yuxuanh): add this to torchstore.
+    @abstractmethod
+    async def release_all(self, prefix: str) -> None:


what's the difference between release and delete?

And delete?

I meant the difference between two functions release and delete

LucasLLC · 2025-09-23T22:35:07Z

I like the ideas here but in general I think as a practice I think we should only add what we need as it's needed and used. Great work!

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 26, 2025

casteryh commented Aug 26, 2025

View reviewed changes

src/forge/interfaces.py Outdated Show resolved Hide resolved

casteryh commented Aug 26, 2025

View reviewed changes

src/forge/interfaces.py Outdated Show resolved Hide resolved

casteryh changed the title ~~Add BufferView and RawBuffer interfaces~~ [RFC] BufferView and RawBuffer interfaces Aug 26, 2025

casteryh requested a review from joecummings August 26, 2025 23:29

casteryh changed the title ~~[RFC] BufferView and RawBuffer interfaces~~ Add BufferView and RawBuffer interfaces Aug 27, 2025

casteryh force-pushed the pr77 branch from 274959d to ddb7cc3 Compare August 27, 2025 00:03

casteryh changed the title ~~Add BufferView and RawBuffer interfaces~~ [RFC] Add BufferView and RawBuffer interfaces Aug 27, 2025

casteryh force-pushed the pr77 branch from ddb7cc3 to c30401f Compare August 27, 2025 00:04

This was referenced Aug 27, 2025

Implement SimpleRawBuffer, a RawBuffer backed by a python dict. #78

Closed

Add StatefulSampler interface and implement RandomStatefulSampler #79

Closed

casteryh requested a review from ebsmothers August 27, 2025 22:04

casteryh force-pushed the pr77 branch from c30401f to f4c9c09 Compare August 28, 2025 02:02

casteryh mentioned this pull request Aug 28, 2025

Allow passing in custom sampler in ReplayBuffer #86

Closed

casteryh force-pushed the pr77 branch from f4c9c09 to 6547207 Compare August 28, 2025 02:15

[RFC] StoreInterface

4a18203

Summary: This is a proposed interface that will be the backend of the replay buffer. I have included methods that would be useful from the point of view of making a replay buffer. This also doubles as a proposal for the actual torchstore api. Test Plan: n/a

casteryh force-pushed the pr77 branch from 6547207 to 4a18203 Compare September 9, 2025 22:10

casteryh changed the title ~~[RFC] Add BufferView and RawBuffer interfaces~~ [RFC] StoreInterface Sep 9, 2025

casteryh requested review from DNXie and LucasLLC September 9, 2025 22:10

casteryh mentioned this pull request Sep 9, 2025

More APIs meta-pytorch/torchstore#31

Open

joecummings reviewed Sep 9, 2025

View reviewed changes

DNXie reviewed Sep 10, 2025

View reviewed changes

casteryh closed this Oct 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] StoreInterface #77

[RFC] StoreInterface #77

Uh oh!

casteryh commented Aug 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

casteryh commented Sep 9, 2025

Uh oh!

casteryh commented Sep 9, 2025

Uh oh!

joecummings Sep 9, 2025

Uh oh!

casteryh Sep 9, 2025 •

edited

Loading

Uh oh!

casteryh Sep 9, 2025 •

edited

Loading

Uh oh!

LucasLLC Sep 23, 2025

Uh oh!

DNXie Sep 10, 2025

Uh oh!

LucasLLC Sep 23, 2025

Uh oh!

DNXie Sep 23, 2025

Uh oh!

LucasLLC commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[RFC] StoreInterface #77

[RFC] StoreInterface #77

Uh oh!

Conversation

casteryh commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

casteryh commented Sep 9, 2025

Uh oh!

casteryh commented Sep 9, 2025

Uh oh!

joecummings Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

casteryh Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

casteryh Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LucasLLC Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

DNXie Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

LucasLLC Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

DNXie Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

LucasLLC commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

casteryh commented Aug 26, 2025 •

edited

Loading

casteryh Sep 9, 2025 •

edited

Loading

casteryh Sep 9, 2025 •

edited

Loading