Skip to content

Commit 3c4a55e

Browse files
authored
[bugfix] fix megatron kto pp + sp (#7882)
1 parent 79e49da commit 3c4a55e

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

swift/megatron/trainers/kto_trainer.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -128,6 +128,8 @@ def forward_step(self, data_iterator, model):
128128
kl_data.pop('loss_scale', None)
129129

130130
length = self._get_kto_length(data)
131+
if self.args.sequence_parallel:
132+
length //= mpu.get_tensor_model_parallel_world_size()
131133
with torch.no_grad(), self.null_ref_context() as ref_models:
132134
ref_model = ref_models[vp_stage or 0]
133135
if self.args.calculate_KL:

0 commit comments

Comments
 (0)