-
Notifications
You must be signed in to change notification settings - Fork 550
Description
Describe your usage question
Hi all, thanks for this marvelous work! Helps a lot!
When using STORE, I am a little confused.
Scenario
Say I have A->B. I manage a buffer, say torch.empty(8gb) -> buf_ptr on A
- I call
resigter_memoryon buf_ptr - I allocate 1GB on A within
buf_ptr, which I am sure about. Say it's about kv,kv_ptr = SliceFrom(buf_ptr, 1gb)- I will write sth into
kv_ptr
- I will write sth into
- I wanna call
put_from(key, ptr, size), orbatch_put_fromfor many.
On another machine, say B, I call get_into or batch_get_into from the key.
Questions
All I want is Store to help me MANAGE THE META!!
I do now need, and want it to demand another space(especially GPU) for its work.
But I found that I cannot accomplish with small global_segment_size
Why? I wish it can just like PUBLISH the meta.
put_from/get_into, like what they said, Store does not HOLD any raw data.
A lot of thanks!
🤔 I know, TE suits me better, if I transfer the ptr, len and sth else with my own message system, and just call the transfer op. But I want to use the K-V philosophy of Mooncake Store. I want global machines have access to the data via a KEY, not the hand-crafted META, and Hi storage, eviction, topology awareness, etc... I think it's much neater if the Store can support it at the framework level than I do. Now it seems like I have no control over its own management of CUDA memory of Store.