Skip to content

Commit 84a1531

Browse files
authored
[TRTLLM-9488][feat] use FlashInfer.sampling by default (#9545)
Signed-off-by: ixlmar <[email protected]>
1 parent 3e4f238 commit 84a1531

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

tensorrt_llm/llmapi/llm_args.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2719,7 +2719,7 @@ class TorchLlmArgs(BaseLlmArgs):
27192719
_quant_config: Optional[QuantConfig] = PrivateAttr(default=None)
27202720

27212721
disable_flashinfer_sampling: bool = Field(
2722-
default=True,
2722+
default=False,
27232723
description=
27242724
"Disable the use of FlashInfer.sampling. This option is likely to be removed in the future.",
27252725
status="prototype",

tests/unittest/api_stability/references/llm.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -109,7 +109,7 @@ methods:
109109
status: beta
110110
disable_flashinfer_sampling:
111111
annotation: bool
112-
default: True
112+
default: False
113113
status: prototype
114114
moe_config:
115115
annotation: tensorrt_llm.llmapi.llm_args.MoeConfig

0 commit comments

Comments
 (0)