Skip to content

Commit 743ae21

Browse files
turn off plugin on deepseek models to prevent error
1 parent 2ce535f commit 743ae21

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

deepseek/engine-deepseek-r1-distill-qwen-14b/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ trt_llm:
3838
tensor_parallel_count: 1
3939
plugin_configuration:
4040
use_paged_context_fmha: true
41-
use_fp8_context_fmha: true
41+
use_fp8_context_fmha: false
4242
paged_kv_cache: true
4343
runtime:
4444
batch_scheduler_policy: max_utilization

deepseek/engine-deepseek-r1-distill-qwen-32b/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ trt_llm:
3838
tensor_parallel_count: 1
3939
plugin_configuration:
4040
use_paged_context_fmha: true
41-
use_fp8_context_fmha: true
41+
use_fp8_context_fmha: false
4242
paged_kv_cache: true
4343
runtime:
4444
batch_scheduler_policy: max_utilization

0 commit comments

Comments
 (0)