tiny-fsdp

A minimal implementation of Fully Sharded Data Parallel (FSDP) in the style of PyTorch FSDP2.

At a high level, parameters are unsharded on demand for computation and resharded afterwards, during both forward and backward stages.

Drop-in wrapper fully_shard() for PyTorch modules
Automatic sharding/unsharding during training
Minimal reference (~1 file, <300 LOC).

Usage

# Install dependencies using uv
uv sync
source .venv/bin/activate

# Run training example (2 GPUs)
torchrun --nproc_per_node=2 train.py

Testing

# Install dependencies using uv
uv sync

# Run all tests
uv run pytest -v

Project Structure

.
├── src/
│   └── fsdp.py          # Core FSDP implementation
├── tests/
│   └── test_fsdp.py     # Unit tests for FSDP
├── train.py             # Example training script with LLaMA
└── README.md

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
train.py		train.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

tiny-fsdp

Usage

Testing

Project Structure

About

Uh oh!

Releases

Packages

Languages

Flpha0830/tiny-fsdp

Folders and files

Latest commit

History

Repository files navigation

tiny-fsdp

Usage

Testing

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages