Skip to content

Commit 97cfa65

Browse files
hmellorIsotr0py
andauthored
Add pipeline parallel support to TransformersModel (#12832)
Signed-off-by: Harry Mellor <[email protected]> Signed-off-by: Isotr0py <[email protected]> Co-authored-by: Isotr0py <[email protected]>
1 parent 911c8eb commit 97cfa65

File tree

4 files changed

+245
-88
lines changed

4 files changed

+245
-88
lines changed

docs/source/models/supported_models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -73,7 +73,7 @@ The Transformers fallback explicitly supports the following features:
7373

7474
- <project:#quantization-index> (except GGUF)
7575
- <project:#lora-adapter>
76-
- <project:#distributed-serving> (pipeline parallel coming soon <gh-pr:12832>!)
76+
- <project:#distributed-serving> (requires `transformers>=4.49.0`)
7777

7878
#### Remote code
7979

tests/distributed/test_pipeline_parallel.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -175,6 +175,8 @@ def iter_params(self, model_id: str):
175175
"inceptionai/jais-13b-chat": PPTestSettings.fast(),
176176
"ai21labs/Jamba-tiny-dev": PPTestSettings.fast(),
177177
"meta-llama/Llama-3.2-1B-Instruct": PPTestSettings.detailed(),
178+
# Tests TransformersModel
179+
"ArthurZ/Ilama-3.2-1B": PPTestSettings.fast(),
178180
"openbmb/MiniCPM-2B-sft-bf16": PPTestSettings.fast(),
179181
"openbmb/MiniCPM3-4B": PPTestSettings.fast(),
180182
# Uses Llama
@@ -243,6 +245,7 @@ def iter_params(self, model_id: str):
243245
# [LANGUAGE GENERATION]
244246
"microsoft/Phi-3.5-MoE-instruct",
245247
"meta-llama/Llama-3.2-1B-Instruct",
248+
# "ArthurZ/Ilama-3.2-1B", NOTE: Uncomment after #13905
246249
"ibm/PowerLM-3b",
247250
# [LANGUAGE EMBEDDING]
248251
"intfloat/e5-mistral-7b-instruct",

0 commit comments

Comments
 (0)