[fix] improve Sglang kt-kernel detect time duration (#1887)

YIFANCHENGDU · gemini-code-assist[bot] · web-flow · commit 8561a71dd11e · 2026-03-18T23:07:40.000+08:00
* Increase timeout for Check if --kt-gpu-prefill-token-threshold is in the help output to 90 seconds.

In cloud environments,CUDA initialization and Python module loading can easily exceed 30 seconds.

* Update kt-kernel/python/cli/utils/sglang_checker.py

add comment about the change

Co-authored-by: gemini-code-assist[bot] &lt;176961590+gemini-code-assist[bot]@users.noreply.github.com&gt;

---------

Co-authored-by: gemini-code-assist[bot] &lt;176961590+gemini-code-assist[bot]@users.noreply.github.com&gt;
diff --git a/kt-kernel/python/cli/utils/sglang_checker.py b/kt-kernel/python/cli/utils/sglang_checker.py
@@ -324,7 +324,7 @@ def check_sglang_kt_kernel_support(use_cache: bool = True, silent: bool = False)
             [sys.executable, "-m", "sglang.launch_server", "--help"],
             capture_output=True,
             text=True,
-            timeout=30,
+            timeout=90,  # Increased for slow CUDA init and module loading in some environments
         )
 
         help_output = result.stdout + result.stderr

Original file line number	Diff line number	Diff line change
`@@ -324,7 +324,7 @@ def check_sglang_kt_kernel_support(use_cache: bool = True, silent: bool = False)`
`324`	`324`	`[sys.executable, "-m", "sglang.launch_server", "--help"],`
`325`	`325`	`capture_output=True,`
`326`	`326`	`text=True,`
`327`		`- timeout=30,`
	`327`	`+ timeout=90, # Increased for slow CUDA init and module loading in some environments`
`328`	`328`	`)`
`329`	`329`
`330`	`330`	`help_output = result.stdout + result.stderr`