【feature】support n parameter #4273

kxz2002 · 2025-09-25T13:19:02Z

Support adding the parameter n to the request to retrieve multiple model responses.

paddle-bot · 2025-09-25T13:19:09Z

Thanks for your contribution!

CLAassistant · 2025-09-26T02:15:09Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
2 out of 3 committers have signed the CLA.

✅ gzy19990617
✅ kxz2002
❌ LiqinruiG
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

luukunn · 2025-09-28T01:58:08Z

fastdeploy/entrypoints/engine_client.py

+            request["prompt_token_ids_len"] = len(request["prompt_token_ids"])
+            input_ids_len = request["prompt_token_ids_len"]
+            request["max_tokens"] = min(self.max_model_len - input_ids_len, request.get("max_tokens"))
+            if request.get("reasoning_max_tokens", None) is None:


这里reasoning_max_tokens的逻辑去掉吧

luukunn · 2025-09-28T02:38:30Z

fastdeploy/entrypoints/openai/serving_chat.py

        chunk_object_type: str = "chat.completion.chunk"
-        first_iteration = True
-        previous_num_tokens = 0
+        n_param = request.n if request.n is not None else 1


直接用num_choices就可以，不用再加一个n_param，

luukunn · 2025-09-28T02:39:29Z

fastdeploy/entrypoints/openai/serving_chat.py

+                        first_iteration[idx] = False

                    output = res["outputs"]
+                    reasoning_content = output["reasoning_content"]


这行删掉

luukunn · 2025-09-28T02:40:05Z

fastdeploy/entrypoints/openai/serving_chat.py


                    delta_message = DeltaMessage(
-                        reasoning_content="",
+                        reasoning_content=reasoning_content,


这行也改成reasoning_content=""

luukunn · 2025-09-28T02:45:32Z

fastdeploy/entrypoints/openai/serving_chat.py

-            prompt_tokens_details=PromptTokenUsageInfo(cached_tokens=final_res.get("num_cached_tokens", 0)),
+            prompt_tokens_details=PromptTokenUsageInfo(cached_tokens=sum(num_cached_tokens)),
        )
-        work_process_metrics.e2e_request_latency.observe(time.time() - final_res["metrics"]["request_start_time"])


这行挪到循环里面就行，不用记latency

luukunn · 2025-09-28T02:46:42Z

fastdeploy/entrypoints/openai/serving_completion.py

            request_prompts = request_prompt_ids

-        num_choices = len(request_prompts)
+        num_choices = len(request_prompts) * request.n


request.n需要判空，下面的n_param = current_req_dict.get("n", 1)可以挪到上面来，用n_param去乘

luukunn · 2025-09-28T02:49:36Z

fastdeploy/entrypoints/openai/serving_completion.py

            try:
                for idx, prompt in enumerate(request_prompts):
-                    request_id_idx = f"{request_id}-{idx}"
+                    request_id_idx = f"{request_id}"


这行可以删掉

luukunn · 2025-09-28T02:49:48Z

fastdeploy/entrypoints/openai/serving_completion.py

                for idx, prompt in enumerate(request_prompts):
-                    request_id_idx = f"{request_id}-{idx}"
+                    request_id_idx = f"{request_id}"
                    current_req_dict = request.to_dict_for_infer(request_id_idx, prompt)


这里参数直接传request_id

kxz2002 and others added 5 commits September 25, 2025 20:42

support n parameter

62c58f6

Merge branch 'develop' into n_param

3d2eea8

pre-commit check

087070c

Merge branch 'n_param' of github.com:kxz2002/FastDeploy into n_param

db0394b

pre-commit check

4e7f494

paddle-bot bot added the contributor External developers label Sep 25, 2025

Merge branch 'develop' into n_param

162cce1

luukunn reviewed Sep 28, 2025

View reviewed changes

kxz2002 and others added 14 commits September 28, 2025 18:08

restore format_and_add_data

bb586cb

Merge branch 'n_param' of github.com:kxz2002/FastDeploy into n_param

dfd8668

update n_param

8c5c409

bug fix index - str to int

e5fd38e

bug fix del child_task

67955ae

bug fix metrics

6151fb0

add debug info

8fe8ae0

add debug info2

1583caf

remove debug info

0793e1f

change connecting symbol to '-'

d8be7d9

bugfix change connecting symbol

a788c19

bugfix change connecting symbol2

103c1e9

unit tests fix

34a06ac

unit test fix2

26b0af2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【feature】support n parameter #4273

【feature】support n parameter #4273

Uh oh!

kxz2002 commented Sep 25, 2025

Uh oh!

paddle-bot bot commented Sep 25, 2025

Uh oh!

CLAassistant commented Sep 26, 2025 •

edited

Loading

Uh oh!

luukunn Sep 28, 2025

Uh oh!

luukunn Sep 28, 2025

Uh oh!

luukunn Sep 28, 2025

Uh oh!

luukunn Sep 28, 2025

Uh oh!

luukunn Sep 28, 2025

Uh oh!

luukunn Sep 28, 2025

Uh oh!

luukunn Sep 28, 2025

Uh oh!

luukunn Sep 28, 2025

Uh oh!

Uh oh!

【feature】support n parameter #4273

Are you sure you want to change the base?

【feature】support n parameter #4273

Uh oh!

Conversation

kxz2002 commented Sep 25, 2025

Uh oh!

paddle-bot bot commented Sep 25, 2025

Uh oh!

CLAassistant commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CLAassistant commented Sep 26, 2025 •

edited

Loading