Skip to content

Commit 2d81be7

Browse files
authored
fix
1 parent ef35cf6 commit 2d81be7

File tree

1 file changed

+0
-1
lines changed

1 file changed

+0
-1
lines changed

lightllm/server/router/model_infer/mode_backend/generic_post_process.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -81,7 +81,6 @@ def sample(logits: torch.Tensor, reqs: List[InferReq], eos_id: List[int] = [2]):
8181
)
8282
int64_batch_next_token_ids = torch.empty_like(batch_next_token_ids, dtype=torch.int64)
8383
int64_batch_next_token_ids[:] = batch_next_token_ids
84-
batch_next_token_ids = int64_batch_next_token_ids.cuda(non_blocking=True)
8584
batch_next_token_probs = torch.gather(probs, dim=1, index=int64_batch_next_token_ids.view(-1, 1))
8685
return batch_next_token_ids.view(-1), torch.log(batch_next_token_probs).view(-1)
8786
else:

0 commit comments

Comments
 (0)