[AutoParallel] fix GPT embedding placements #10834

waliwali777 · 2025-07-10T02:55:42Z

Before submitting

Lint code. If there are lint issues, please format the code first.

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

Add test cases into tests folder. If there are codecov issues, please add tests cases first.

PR types

Others

PR changes

Others

Description

GPT 动半下，修改 placemetns 以提高性能
1、修改 embedding 和 lmhead 层 weight 的 placemetns 为列切
2、修改 GPTEmbeddingsAuto 层输出的 placemetns 为 replicate

修改前：
GPTEmbeddingsAuto 输出为 [Replicate(), Reshard(2)]，导致每次进入 encoder 层的状态都为 [Replicate(), Reshard(2)]，在进行 residual + dropout 计算时，有 [Replicate(), Reshard(2)] -> [Replicate(), Replicate()]，这在前向和反向中会引入 allgather 通信
GPT 单机8卡 H卡最优配置 20layer 吞吐(token/card/s)：7902.79
修改后：
GPTEmbeddingsAuto 输出为 [Replicate(), Replicate()]， encoder 层输入的状态也为 replicate，在进行 residual + dropout 计算时，可以直接进行 replicate 计算，不引入额外通信
GPT 单机8卡 H卡最优配置 20layer 吞吐(token/card/s)：8305.75(+402.96,+5.1%)

paddle-bot · 2025-07-10T02:55:47Z

Thanks for your contribution!

liym27

LGTM

fix embedding placements

d026f51

liym27 approved these changes Jul 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoParallel] fix GPT embedding placements #10834

[AutoParallel] fix GPT embedding placements #10834

Uh oh!

waliwali777 commented Jul 10, 2025

Uh oh!

paddle-bot bot commented Jul 10, 2025

Uh oh!

liym27 left a comment

Uh oh!

Uh oh!

[AutoParallel] fix GPT embedding placements #10834

Are you sure you want to change the base?

[AutoParallel] fix GPT embedding placements #10834

Uh oh!

Conversation

waliwali777 commented Jul 10, 2025

Before submitting

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Jul 10, 2025

Uh oh!

liym27 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!