Share Parameter/Buffer Storage When Cloning Split GraphModules to Eliminate Redundant Tensor Copies #2509

KAVYANSHTYAGI · 2025-09-09T16:44:29Z

Description

Summary

This PR replaces a full copy.deepcopy of the split GraphModule with a lightweight clone that shares Parameter and buffer tensors with the original module. This avoids duplicating model weights during graph splitting and significantly reduces peak memory.

Motivation

copy.deepcopy(split_gm) duplicates every Parameter and registered buffer. For large models this can momentarily double memory usage and trigger OOMs. The splitter only needs a structural snapshot of the module; it does not mutate weights. Sharing tensor storage is therefore safe and much more memory-efficient.

What’s changed

Added _copy_without_tensors(module: nn.Module) -> nn.Module
Uses copy.deepcopy with a pre-populated memo so that all Parameters and registered buffers are reused rather than copied.

Replaced copy.deepcopy(split_gm) with _copy_without_tensors(split_gm) when capturing original_split_gm.

Minor: import itertools (for chain) and improved inline comments.

Implementation notes

The helper clones Python/FX structure while mapping each tensor id in parameters() and buffers() back to the original object:

memo = {id(t): t for t in itertools.chain(module.parameters(), module.buffers())}
clone = copy.deepcopy(module, memo)

Behavior of the splitter is unchanged; only memory characteristics improve.

for more information, see https://pre-commit.ci

Avoid tensor deepcopy in splitter

92911e2

KAVYANSHTYAGI requested review from mruberry, lantiga and t-vi as code owners September 9, 2025 16:44

[pre-commit.ci] auto fixes from pre-commit.com hooks

57c2701

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Share Parameter/Buffer Storage When Cloning Split GraphModules to Eliminate Redundant Tensor Copies #2509

Share Parameter/Buffer Storage When Cloning Split GraphModules to Eliminate Redundant Tensor Copies #2509

Uh oh!

KAVYANSHTYAGI commented Sep 9, 2025

Uh oh!

Uh oh!

Share Parameter/Buffer Storage When Cloning Split GraphModules to Eliminate Redundant Tensor Copies #2509

Are you sure you want to change the base?

Share Parameter/Buffer Storage When Cloning Split GraphModules to Eliminate Redundant Tensor Copies #2509

Uh oh!

Conversation

KAVYANSHTYAGI commented Sep 9, 2025

Description

Uh oh!

Uh oh!