Skip to content

Commit 8561a71

Browse files
[fix] improve Sglang kt-kernel detect time duration (#1887)
* Increase timeout for Check if --kt-gpu-prefill-token-threshold is in the help output to 90 seconds. In cloud environments,CUDA initialization and Python module loading can easily exceed 30 seconds. * Update kt-kernel/python/cli/utils/sglang_checker.py add comment about the change Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
1 parent 7a4b9b0 commit 8561a71

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

kt-kernel/python/cli/utils/sglang_checker.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -324,7 +324,7 @@ def check_sglang_kt_kernel_support(use_cache: bool = True, silent: bool = False)
324324
[sys.executable, "-m", "sglang.launch_server", "--help"],
325325
capture_output=True,
326326
text=True,
327-
timeout=30,
327+
timeout=90, # Increased for slow CUDA init and module loading in some environments
328328
)
329329

330330
help_output = result.stdout + result.stderr

0 commit comments

Comments
 (0)