[QEff. Finetuning]: Adding tests for PP in HF trainer stack by quic-swatia · Pull Request #817 · quic/efficient-transformers

quic-swatia · 2026-02-27T01:30:13Z

No description provided.

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>

quic-akuruvil

Please correct lint error

QEfficient/finetune/experimental/tests/test_pipeline_parallelism.py

quic-akuruvil · 2026-03-23T07:36:55Z

QEfficient/finetune/experimental/tests/test_pipeline_parallelism.py

+            assert not overlap, f"Stages {s_idx} and {t_idx} share layers {overlap} – stages must be disjoint."
+
+    # --- 5. Balance: each stage has base or base+1 layers -----------------
+    base, remainder = divmod(num_layers, pp_degree)


How is balancing ensured here?

What is the strategy/logic used for splitting model layers across the devices?

Consider an example: num_layers = 22, num_stages = 5
With 'base, remainder = divmod(num_layers, pp_degree)', base and remainder turns out to be 4 and 2 resp.

With line #134-139 : 'expected_count = base + (1 if stage_idx < remainder else 0)' : it is checking that each first two (#remainder) devices has 5 (base +1) layers each. And the last 3 (num_stages - remainder) devices has 4 (base) devices each. Hence, ensuring balancing amongst devices.

quic-akuruvil

All these test cases are passing locally?

QEfficient/finetune/experimental/tests/test_pipeline_parallelism.py

quic-akuruvil

test cases looks good, with extensive coverage.

QEfficient/finetune/experimental/tests/test_pipeline_parallelism.py

quic-akuruvil

QAIC_VISIBLE_DEVICES=0 python -m pytest QEfficient/finetune/experimental/tests/
Currently this is how we run the existing tests. So based on this mention how to run PP tests. Include sample commands too, in docs

Ideally all tests in the project should run when above command is executed. So lets keep it like that

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>

quic-swatia · 2026-03-25T05:52:51Z

All these test cases are passing locally?

yes, all of them are passing locally.

quic-swatia · 2026-03-25T06:14:43Z

QAIC_VISIBLE_DEVICES=0 python -m pytest QEfficient/finetune/experimental/tests/ Currently this is how we run the existing tests. So based on this mention how to run PP tests. Include sample commands too, in docs

Ideally all tests in the project should run when above command is executed. So lets keep it like that

2 tests in this file requires # visible devices =2. If 1 is passed, it skips those 2 tests. These two tests run even when QAIC_VISIBLE_DEVICES is not mentioned at all and the machine has >=2 devices.

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>

Adding tests for PP in HF trainer stack

84bc18b

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>

quic-swatia requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners February 27, 2026 01:30

quic-rishinr added the fine-tuning label Mar 2, 2026

quic-akuruvil reviewed Mar 18, 2026

View reviewed changes

QEfficient/finetune/experimental/tests/test_pipeline_parallelism.py Outdated Show resolved Hide resolved

quic-akuruvil reviewed Mar 23, 2026

View reviewed changes

QEfficient/finetune/experimental/tests/test_pipeline_parallelism.py Show resolved Hide resolved

quic-akuruvil reviewed Mar 23, 2026

View reviewed changes

QEfficient/finetune/experimental/tests/test_pipeline_parallelism.py Show resolved Hide resolved

quic-akuruvil reviewed Mar 23, 2026

View reviewed changes

QEfficient/finetune/experimental/tests/test_pipeline_parallelism.py Outdated Show resolved Hide resolved

quic-akuruvil reviewed Mar 24, 2026

View reviewed changes

Addressed review comments, fixed some tests based on recent changes

6ba7a0e

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>

Adding constant file

4b99628

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QEff. Finetuning]: Adding tests for PP in HF trainer stack#817

[QEff. Finetuning]: Adding tests for PP in HF trainer stack#817
quic-swatia wants to merge 3 commits intoquic:ft_experimentalfrom
quic-swatia:pp-test

quic-swatia commented Feb 27, 2026

Uh oh!

quic-akuruvil left a comment

Uh oh!

Uh oh!

Uh oh!

quic-akuruvil Mar 23, 2026

Uh oh!

quic-akuruvil Mar 23, 2026

Uh oh!

quic-swatia Mar 25, 2026

Uh oh!

quic-akuruvil left a comment

Uh oh!

Uh oh!

quic-akuruvil left a comment

Uh oh!

Uh oh!

quic-akuruvil left a comment •

edited

Loading

Uh oh!

quic-swatia commented Mar 25, 2026

Uh oh!

quic-swatia commented Mar 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

quic-swatia commented Feb 27, 2026

Uh oh!

quic-akuruvil left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

quic-akuruvil Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

quic-akuruvil Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

quic-swatia Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

quic-akuruvil left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

quic-akuruvil left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

quic-akuruvil left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

quic-swatia commented Mar 25, 2026

Uh oh!

quic-swatia commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

quic-akuruvil left a comment •

edited

Loading

quic-swatia commented Mar 25, 2026 •

edited

Loading