Skip to content

Commit b34ad11

Browse files
authored
fix
1 parent e4e222f commit b34ad11

File tree

1 file changed

+2
-1
lines changed
  • lightllm/server/router/model_infer/mode_backend/chunked_prefill

1 file changed

+2
-1
lines changed

lightllm/server/router/model_infer/mode_backend/chunked_prefill/impl.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,8 @@ def post_handel(self, run_reqs: List[InferReq], next_token_ids, next_token_logpr
5454

5555
req_obj.cur_kv_len = len(req_obj.get_chuncked_input_token_ids())
5656
if req_obj.cur_kv_len < req_obj.get_cur_total_len():
57-
req_obj.shm_req.shm_cur_kv_len = req_obj.cur_kv_len
57+
if self.tp_rank < self.dp_size:
58+
req_obj.shm_req.shm_cur_kv_len = req_obj.cur_kv_len
5859
continue
5960

6061
req_obj.set_next_gen_token_id(next_token_id, next_token_logprob)

0 commit comments

Comments
 (0)