Skip to content

Commit 64bc002

Browse files
committed
Clean up comment
1 parent 2f1e020 commit 64bc002

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

trinity/trainer/verl/dp_actor.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -118,7 +118,7 @@ def update_policy(self, data: DataProto): # noqa: C901
118118
# calculate the total number of response tokens in the minibatch
119119
mini_batch_token_num = torch.sum(
120120
mini_batch.batch["response_mask"].to(get_device_id())
121-
).item() # TODO: double check this calculation
121+
).item()
122122

123123
self.actor_optimizer.zero_grad()
124124

0 commit comments

Comments
 (0)