Skip to content

Add shard metadata to TensorDescriptor for mixed TP#158

Draft
ishandhanani wants to merge 2 commits intomainfrom
idhanani/mx-mixed-tp-v1
Draft

Add shard metadata to TensorDescriptor for mixed TP#158
ishandhanani wants to merge 2 commits intomainfrom
idhanani/mx-mixed-tp-v1

Conversation

@ishandhanani
Copy link
Contributor

Summary

  • Adds full_shape, shard_dim, effective_tp_size, shard_index to TensorDescriptor in p2p.proto
  • Enables target workers to compute byte-range overlaps for mixed tensor-parallelism (seed TP != target TP)
  • Updates all backends (memory, redis, k8s), Rust types, and Python stubs

Depends on

Test plan

  • cargo build --release + cargo test pass
  • E2E with SGLang mixed TP (Llama-3.3-70B-Instruct-FP8 TP2->TP2, TP2->TP4)

ishandhanani and others added 2 commits March 5, 2026 02:20
Extend WorkerMetadata with oneof backend_metadata supporting both
NIXL (bytes) and TransferEngine (session_id string). Update all
metadata backends (memory, redis, kubernetes, layered) and regenerate
Python protobuf stubs for protobuf 5.x compatibility.
Adds full_shape, shard_dim, effective_tp_size, and shard_index to
TensorDescriptor in p2p.proto. These fields enable target workers to
compute byte-range overlaps when source and target have different tensor
parallelism degrees (e.g., seed TP=2, target TP=4).

Updates all backends (memory, redis, k8s), types, and Python stubs.
@ishandhanani ishandhanani force-pushed the ishan/transfer-engine-backend branch from a9f1142 to cfbe428 Compare March 16, 2026 04:09
Base automatically changed from ishan/transfer-engine-backend to main March 17, 2026 20:25
@pull-request-size pull-request-size bot added size/L and removed size/M labels Mar 17, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant